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KLARQUIST. SPARKMAN, CAMPBELL. LEIGH & 

WHINSTON. LLP 

ONE WORLD TRADE CENTER 

121 SW SALMON STREET* SUITE 1600 

PORTLAND OR 97204 


per 

WRITTEN OPINION 

(P( rr Rule 66) 


Date of Mailing 

(day/monUi/year) 27JUI ZOOtt 


AppUcant's or agent's file reference 
4630-53860 


REPLY DUE ... . 

within 1 WO nionihs 

from the above dale of maiUng 


International application No. 
PCT/US99/28655 


International filing date ((lny/monih/yctir) 
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Priority dale (day/monOtfyear) 
07 DECEMBER 1998 


International Patent Classification (IPC) or both national class irication and IPC 
Please See Supplemental Sheet. 


Applicant 

WASHINGTON STATE UNIVERSITY RESEARCH FOUNDATION 



1 . This written opinion is the 



first 



(first, etc.) drawn by this Inlemalional Preliminary Examining Authority. 



2. This opinion contains indications relating to the following items: 



I 




II 


□ 


III 


m 


IV 


0 


V 




VI 


□ 


VII 


□ 


VIII 


□ 



Reasoned statement under Rule 66.2(a)(ii) with regard to novelty, invciuivc sicp nr industrial applicability; 
citations and explanations supporting such statement 

Certain documents cited 



/ I I Certain observations on the international application 

3. The applicant is hereby invited to reply to this opinion. 
•When? 



^{ lh«l lim e l i mit, r e que t it ihin 



How? 
Also 



See the time limit indicated above. The appl i oanl may, bafnro Iho oKpirhl ii 
Authority to grant on e xt e noion.t noo Rulo 66i3(d)i 

By submitting a written reply* accompanied, where appropriate, by amcndmcnls. according lo Rule 66.3. 
For the form and the language of the amendments, see Rules 66.8 and 66.9. 



For an additional opportunity to submit amendments, sec Rule 66.4. 

For the examiner's obligation to consider amendments and/or arguincnis. sec Rule 66.4 bh. 
For an informal communication with the examiner, sec Rule 66.6. 
If no reply is filed, the international preliminary examination report will be cslahlislicd on liic ha.sis of this opinion. 

4. The final date by which the international preliminary addii mm 

examination report must be established according lo Rule 69.2 is: 07 APRIL -00 . 



' ■ 


Name and mailing address of the I PEA/US 
• . Commissioner of Patents and Trademarks 
Box PCT 

Washington. D.C. 2023 1 
FacsimUe No. (703) 305-3230 


NtANJU?Wt>l RAO ^^^^^ Y"^ 
TelepWnie No. (705) 308-0196 
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KLARQUIST, SPARKMAN, CAMPBELL, LEIGH & 

WHINSTON. LLP 

ONE WORLD TRADE CENTER 

121 SW SALMON STREET, SUITE 1600 

PORTLAND OR 97204 


PCX 

NOTIFICATION OF TRANSMITTAL OF 
INTERNATIONAL PRELIMINARY 
EXAMINATION REPORT 

(POT Rule 71.1) 


Date of Mailing 

(..y™„*^.. 22,!AN?nni - 


Applicant's or agent's file reference 
4630-53860 


IMPORTANT NOTIFICATION 


International application No. 
PCT/US99/28655 


Internationa! filing date (day /month/year) 
06 DECEMBER 1999 


Priority Date (day/month/year) 
07 DECEMBER 1998 


Applicant 

WASHINGTON STATE UNIVERSITY RESEARCH FOUNDATION 



4:v 



1 . The applicant is hereby notified that this International Preliminary Examining Authority transmits herewith the 
international preliminary examination report and its annexes, if any, established on the international application. 

2. A copy of the report and its annexes, if any, is being transmitted to the International Bureau for communication 
to all the elected Offices. 



3. Where required by any of the elected Offices, the International Bureau will prepare an English translation of 
the report (but not of any annexes) and will transmit such translation to those Offices. 



4. REMINDER 

The applicant must enter the national phase before each elected Office by performing certain acts (filing 
translations and paying national fees) within 30 months from the priority date (or later in some Of rices)( Article 
39(l))(see also the reminder sent by the International Bureau with Form PCT/IB/301). 

Where a translation of the international application must be furnished to an elected Office, that translation must 
contain a translation of any annexes io the international prelimmar>' examination report. It is the applicant's 
responsibility to prepare and furnish such translation directly to each elected Office concerned. 

For further details on the applicable time limits and requirements of the elected Offices, see Volume II of the 
PCT Applicant's Guide. 
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Name and mailing address of the IPEA/US 

Commissioner of Patents and Trademarks 
Box PCT 

Washington. D.C 20231 
Facsimile No. (703) 305-3230 


Authorized officer / / jjl ( 

MANJUNATH RAO COUJNS 
Telephone No. (703) 308-B<^6AL SPEOAl^ 

' IhOHNOLOaV CCMTCn 1600 ' 



^Itent cooperation TRE^pr 

PCX 

INTERNATIONAL PRELIMINARY EXAMINATION REPORT 
(PCT Article 36 and Rule 70) 



Applicant's or agent's file reference 
4630-53860 


^pp Notification of Transmittal of International 

FOR FURTHER ACTION 


International application No. 
PCT/US99/28655 


International filing date (day/month/year) 
06 DECEMBER 1999 


Priority date (aay/monm/yearj 
07 DECEMBER 1998 


International Patent Classification (IPC) 
Please See Supplemental Sheet. 


or national classification and IPC 




^wSmGTON STATE UNIVERSITY RESEARCH FOUNDATION 



This international preliminary examination repon na^ "-^"y'^^;'-"^. "T 
Examining Authority and is transmittet^^e applicant according to Article 36. 

This REPORT consists of a total of i^. sheets. 

• J K„ AMMPYF<; i e sheeU of the description, claims and/or drawings which have 

^ITte 70.16 .nd Seot.on 607^= Adminismtive Imtn«l,o,» the PCT). 
These annexes consist of a total of C-<^ sheets. . ^ 



3. This report contains indications relating to the following items: 



I 




II 


□ 


III 


[3 


IV 


□ 


V 




VI 


□ 


VII 


□ 


vm 


□ 



IV Q Lack of unity of invention 

Reasoned statement under Article 35(2) with regard t 
citations and explanations supportmg such statement 



Date of submission of the demand 
/^MAY 2000 



Name and mailing address of the IPEA/US 
Commissioner of Patents and Trademarks 
Box PCT 

Washington, D.C. 20231 
Facsimile No. (703) 305-3230 



Form PCT/IPE A/409 (cover sheet) (July 1998)^ 



Date of completion of this report 
04 DECEMBER 2000 



Authorized officer 

MANJUNATH RAO 
Telephone No. (703) 3; 



,oPjy^MAECOUJNS 



J^CHNOLOGY CENTER 1600 



'INTERNATIONAL PRELIMINARY EXAMINATION REPORT 



Intei^^hal application No. 
PCT/US99/28655 



L Basis of the report 



1. With regard to tfie elements of the international application:* 
the international application as originally filed 
rrn the description: 

^ pages 

pages NONE 



_ , as originally filed 
filed with the demand 



pages 



NONE 



, filed with the letter of 



Sthe claims: 
39-41 

pages 

pages NONE 

pages NONE 

pages NONE 



, as originally filed 

, as amended (together with any statement) under Article 19 

filed with the demand 



filed with the letter of 



nn the drawings: 

' — ' 1-21 
pages 

pages 

pages NQN^ 



NONE 



, as originally filed 

, filed with the demand 



the sequence listing part of the description: 

*— ' i_7 

pages LJ. 

pages NONE ^ 

pages NONE , 



, filed with the letter of . 



, as originally filed 



, filed with the demand 



filed with the letter of . 



2 W.th regani to the langMag.. all the elements marked above ^vere available or fished to this Authority m .he larrguage in which 
the inteSor^l applLt^n was filed, unless oAerw,^ mdicate^ •¥>^r*is tern - which is: 

TT^ese elements were available or ftimished to this Authonty in the following language _ _ 

□ the language of a translation furnished for the purposes of intemational search (under Rule 23.1(b)). 

□ the language of publication of the intemational application (under Rule 48.3(b)). 

□ the language of the translation furnished for the purposes of international preliminary examination (under Rules 55.2 and/ 
or 55.3). 

3 With regard to any nucIeoHde and/or amino acid sequence disclosed in the intemational application, the mtemational 
preliminary examination was carried out on the basis of the sequence listing: 
fx] contained in the intemational application in printed form. 

filed together with the intemational application in computer readable form. 
I I furnished subsequently to this Authority in written form. 
r~l furnished subsequently to this Authority in computer readable form. 

n The statement that the subsecuently finished ^^T.tten sequence listmg does not go beyond the disclosure in the 
I I international application as filed has been furnished. 

n The statement that the mfoimation recorded in computer readable fom, is identical to the vvriten sequence listing has 
' — ' been furnished. 
4 [xj The amendments have resulted in the cancellation of: 

m - NONE 

LiJ the description, pages _ 

[1 the claims. Nos. NONE 



fx] the drawings, sheets/fig NONE 

5 . □ ™s repon has been dra^.. as if (some oO the amendments had not b-n made since .hey have been considered to go 
^ beyond the disclosure as filed, as indicated in the Suppl^ental ^ox ^^^^70^2^)^ ^ 

.:l'r;!,Lm.n, s^e' r...."-"" ^>^ri. an.en dm.nn m,.. „e r eferred ,o unrier item I and annexed to this report. 
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InVRtional application No. 
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IIL Non-establishment of opinion with regard to novelty, inventive step and industrial applicabiiity 



1 . The questions whether the claimed invention appears to be novel, to involve an inventive step (to be non obvious), or to be 
industrially applicable have not been and will not be examined m respect of: 

I [ the entire international application. 

[x\ claims Nos. 12-13. 19. 22-27 

because: 

r— I the said international application, or the said claim Nos. . relate to the following subject matter which 
I I does not require international preliminary examination (specify). 



rn the description, claims or drawings (wdicate particular elements below) or said-claims Nos. . are so 
— unclear that no meaningful opinion could be formed (specify). 



j— j the claims, or said claims Nos. _ are so inadequately supported by the description that no meaningful 
— opinion could be formed. 

[U no international search report has been established for said claims Nos. 12^3^19^22^. 



2 A meaningful mtemational pi^liminaiy examination cannot be cairied out due to the failure of the nucleotide and/or amino acid 
■ sequence listing to comply with the standard provided for in Annex C of the Administrative Instmcttons: 

□ the written form has not been furnished or does not comply with the standard. 

□ the computer readable form has not been furnished or does not comply with the standard. 
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Intei^^pia 



Ral application No. 
PCT/US99/28655 



IV. Lack of unity of invention 



1. In response to the invitation to restrict or pay additional fees the applicant has: 



restricted the claims. 



□ 

["3c] paid additional fees. 
j I paid additional fees under protest. 
I [ neither restricted nor paid additional fees. 

2 r-l This Authority found that the requ.rement of un>ty of invention .s not comphed with and chose, accord.ng to Rule|68.1 

I— ' not to invite the applicant to restrict or pay additional fees. 



3. -ms Authonty consider, that the requirement of unity of invention in accordance v«th Rules 13.1. 13.2 and 13.3 is 
I [ complied with. 

[~1 not complied with for the following reasons: 



4. Consequently » the following parts 
in establishing this report: 



of the international application were the subject of international preliminary examination 



I I all parts. 

Q the parts relating to claims Nos. . MJ^l4a8„and.20^ 
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V Reasoned staten^ent under Article 35(2) .Uh regard to novelty, inventive step or industrial applicability; 
citations and explanations supporting such statement 



statement 
Novelty (N) 

Inventive Step (IS) 
Industrial Applicability (lA) 



Claims NONE 

Claims 1-11- 14-18. 20.21 



Claims NONE 



Claims 1-11, 14-18. 20-21 



Claims 1-11. 14-18,20-21 



Claims NONE 



YES 
NO 

YES 
NO 



YES 
NO 



98/45461, 15 October 1998). ■ . 

(WO 96/21022. . . M. 1996) o, Knmon el a.. (WO „ <,d„.,2 ieii»»a». .a<l 



NEW CITATIONS 

WO 96/21022 A2 (THOMAS et al.) 1 1 JULY 1996. see entire document. 

WO 98 467^ Al KNUTZON et al.) 22 OCTOBER 1998, see ent.re document. 

Zo 98 45461 Al THOMAS et al.) 15 OCTOBER 1998. see entire document. 
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Supplemental Box ... . ft- „.\ 

(To be used when the space in any of the preceding boxes is not sufficient) 



Continuation of: Boxes I - VIII 



Sheet 10 



CLASSIFrcATION^_^^^^ Patent Classification (IPC) and/or the National classification are as listed ^ 

IPC(7): C12N 01/10. 01/12, 9/00, 15/06. 15/09, 15/12, 15/30 and US CI.: 435/134. 189. 254.2. 325. 320.1, 536/23.2 



Form PCT/IPE A/409 (Supplemental Box) (July 1998)* 



INTERNATIONAL SEARCH REPORT 


International application No. 




PCT/US99/28655 



Box 1 Obrervations where certain claims were found unsearchable (Continuation of item 1 of first sheet) 



This international repoit has not been established in respect of certain claims under Article 17(2)(a) for the following reasons: 
Claims Nos.: 

because they relate to subject matter not required to be searched by this Authority, namely: 



Claims Nos.: 

because they relate to parts of the international application that do not comply with the prescribed requirements to such 
an extent that no meaningful international search can be carried out, specifically: 



Claims Nos.: 

because they are dependent claims and are not drafted in accordance with the second and third sentences of Rule 6.4(a). 



Box 1! Observations where unity of invention is lacking (Continuation of item I of first sheet) 
This International Searching Authority found multiple inventions in this international application, as follows: 
Please Sec Extra Sheet. 



As all required additional search fees were timely paid by the applicant, this international search report covers all searchable 
claims. 

As all searchable claims could be searched without effort justifying an additional fee, this Authority did not invite payment 
of any additional fee. v 

As only some of the required additional search fees were timely paid by the applicant, this international search report covers 
only those claims for which fees were paid, specifically claims Nos.: 



No required additional search fees were timely paid by the applicant. Consequently, this international search report is 
restricted to the invention first mentioned in the claims; it is covered by claims Nos.: 



Remark on Protest | ) The additional search fees were accompanied by the applicant's protest. 

\ X| No protest accompanied the payment of additional search fees. 

Form PCT/ISA/210 (continuation of first shect(l))(July 1992)* 



INTERNATIONAL SEARCH REPORT 



International application No. 
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BOX 11. OBSERVATIONS WHERE UNITY OF INVENTION WAS LACKING 
This ISA found multiple inventions as follows: 

This application contains the following inventions or groups of inventions which are not so linked as to form a single 
inventive concept under PCT Rule 13.1. In order for all inventions to be searched, the appropriate additional search 
fees must be paid. 

Group I, claim(8)l, 9, and 18, drawn to desaturase enzyme. 

Group II, claim<s) 2-8, 10-11, 14-17 crd 20-21, drawn to polynucleotides, vectors and host cells. 

Group III, claim 12, drawn to a transgenic organism. 

Group rV, claims 13 and 19, drawn to specific binding agent. 

Group V, claim 22, drawn to a method of creating a double bond in a fatty acid. 

Group VI, claims 23-24, drawn to a method creating a double bond in a fatty acid under 'u». \ conditions. 

Group VII, claim 25, drawn to a method of creating a double bond in a fatly acid under in vitro conditions. 

Group VIII, claim 26-27, drawn to a method of creating a double bond in a fatty acid using a second desaturase 
enzyme. 

Tlie inventions listed as Groups I- VIII do not relate to a single inventive concept under PCT Rule 13.1 because, under 
PCT Rule 13.2, they lack the same or corresponding special technical features for the following reasons: 

Group I is a product; this shares the special technical feature of an enzyme, which groups II-VUl do not share. 



Group II is a product; this shares the special technical feature of DNA molecules, which groups I and III-VIII do not 
share. 

Group III is a product; this shares the special technical feature of a transgenic organism, which groups I-II and IV-VIII 
do not share. 

Group rv is a product; this shares the special technical feature of a chemical compound, which groups I-II I and V-VIII 
do not share. 

Group V is a general method; this shares the special technical feature of a biochemical reaction, which groups I -IV and 
VI-VIII do not share. 

Group Vl.is a specific method; this shares the special technical feature of in yivo^ reaction, which groups I-V and VII- 
VIII do not share. v 

Group VII is a specific method; this shares the special technical feature of in vitro chemical reaction, which groups I -VI 
and VIII do not share. 

Group VI 11 is a specific method; this shares the special technical feature of an enzymatic reaction, which groups 1-V1I 
do not share. 
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Assistant Connmissioner for Patents 

1 InitoH Qtatoc Patpnt anH TraHpmflrW 
UiiilcU Oldlco rdlcMl ailU i laUCiMaii^. 

Office 
Box PCT 

Washington, D.C.20231 
ETATS-UNIS D'AMERIQUE 

in its cspacity as elected Office 


Date of mailing {day/month/year) 
20 July 2000 (20.07.00) 




International application No. 
PCT/US99/28655 


Applicant's or agent's file reference 
4630-53860 


International filing date (day/month/year) 

06 December 1999 (06.12.99) 


Priority date (day/month/year) 

07 December 1998 (07.12.98) 


Applicant 

BROWSE, John, A. eta! 





1. The designated Office is hereby notified of its election made: 

j X 1 In the demand filed with the International Preliminary Examining Authority on: 

16 May 2000(16.05.00) 






1 1 in a notice effecting later election filed with the International Bureau on: 

2. The election | X | was 

I 1 was not 

made before the expiration of 1 9 months from the priority date or, where Rule 32 applies, within the time limit under 
Rule 32.2(b). 







Authorized officer 


The International Bureau of WlPO 




34. chemin des Colombettes 


Pascal PIriou 


1211 Geneva 20, Switzerland 




Facsimile No.: (41-22) 740.14,35 


Telephone No.: (41-22) 338.83.38 
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A. CLASSIFICATION OF SUBJECT MATTER 
IPC(7) :C12N 01/10. 01/12, 9/00. 15/06. 15/09, 15/12. 15/30 
US CL : 435/134. 189, 254,2, 325. 320.1; 536/23.2 
According to International Patent Classification (IPC) or to both national classification and IPC 



a FIELDS SEARCHED 



Minimum documentation searched (classification system followed by classification symbob) 
U.S. : 435/134, 189, 254.2. 325. 320.1; 536/23.2 



Documentation searched other than minimum documentation to the extent that such documents are included in the fieHs searched 



Electronic data base consulted during the international search (name of data base and, where practicable, search terms used) 
REGISTRY, CA, CAPLUS, BIOSIS, EMBA3E, MEDLINE, A0^UCO^A, TOXLIT, USPATFULL 



DOCUMENTS CONSIDERED TO BE RELEVANT 



Category* 



A, Y 



Citation of document, with indication, where appropriate, of the relevant passages 



US 5,057,419 (MARTIN et al.) 15 October 1991, see entire 
document. 



Relevant to claim No. 



1-11, 14-17,20-21 



I [ Further documents are listed in the continuation of Box C. | | See patent family annex. 



* Speciml categortes of cited dociunenu: 

*A* document defoiing the general state of the art which it not considered 

to be of p«rticulmr relevance 

*E* earlier document published on or after the international filing date 

'L* doctunent which may throw doubts on priority claim (s) or which is 

cited to establish the publiealion date of another citation or other 
special reason (as specified) 

*0* document referrmg to an oral disclosure, use. exhibition or other 

means 

*P* document published prior to the tntemational filing date but later than 
the priority date claimed 



later document published after the international filing date or priority 
date and not in conflict with the application but cited to understand 
the principle or theory underlying the invention 

document of particular relevance; the claimed invention cannoi be 
coiuidered novel or cannot be considered to involve an inventive step 
when the document is taken alone 

document of particular relevance; the claimed invention cannot be 
considered to involve an inventive step when the doctmiem is 
combined with one or more other such doctmtcnts. such combination 
being obvious to a person skilled in the art 

doctmient member of the same patent family 



Date of the actual completion of the international search 
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The amino acid and nucleic acid sequences of a A^-desaturase enzyme and a A^-desaturase enzyme arc disclosed. TTie nucleic 
acid sequences can be used to design recombinant DNA constructs and vectors. These vectors can then be used to transform various 
organisms, including for example, plants and yeast. The transformed organisms will then produce polyunsaturated fatty acids. The amino 
acid sequences are useful for generating enzyme-specific antibodies that are useful for identifying the desaturases. 
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DESATURASES AND METHODS 
OF USING THEM FOR SYNTHESIS OF 
POLYUNSATURATED FATTY ACIDS 

5 FIELD OF THE INVENTION 

The invention relates to desaturase enzymes that can be used to produce 
polyunsaturated fatty acids with important dietary applications. 

BACKGROUND 

1 0 Fatty acids are fundamental components of living systems. They make up the major 

component of cytoplasmic membranes, common to plants, animals and protists alike. 

Fatty acids of 20 carbons, with more than one unsaturated carbon-carbon bond 
along the hydrocarbon chain, are known to be of particular importance. Arachidonate (20:4) 
(Heinz, Lipid Metabolism in Plants, pp. 33-89, 1993; Yamazaki et al. Biocliim, Bioptiys. Acta 

15 1123:18-26, 1992; Uisamer et a!.. J. Cell Biol. 43:105-1 14, 1969; and Albert et aL Lipids 

14:498-500, 1979) and eicosapentaenoate (20:5) (Heinz, Lipid Metabolism in Plants, pp. 33- 
89, 1993; Yamazaki et aL, Biochim. Biophys, Acta 1123:18-26, 1992; Uisamer et a!., J. Cell 
Biol. 43:105-114, 1969; Albert et al. Lipids 14:498-500, 1979; and Cook et a!., J. Lipid Res. 

32:_1 265^1 273. 1991). commonly referred to as EPA , are si gnificant components of 

20 mammalian cell membranes and are also precursors of signal molecules including 
prostaglandins. Certain specialized mammalian tissues such as brain (Naughton, J. 
Biochem. 13:21-32, 1981), testes (Wilder and Conigiio, Proc. Soc. Exp. Biol. Med. 177:399- 
405, 1984), and retina (Aveidano de Caidironi et al., Prog. Lipid Res. 20:49-57, 1981) are 
especially rich in unsaturated fatty acids. 

25 Arachidonate and eicosapentaenoate serve both as precursors for synthesis of 22- 

carbon polyunsaturated fatty acids and, with dihomo-y-linoleate (20:3) (Yamazaki et al., 
Biochim. Biophys. Acta 1123:18-26, 1992; Uisamer etaL, J. Cell BioL 43:105-114, 1969; and 
Albert et al., Lipids 14:498-500, 1979), as precursors to the synthesis of eicosanoid 
metabolic regulators (Hwang, Fatty Acids in Foods and Their Health Implications, 545-557, 

30 1992). Key enzymes in the synthesis of 20-carbon fatty acids are desaturases, which 

introduce cis double bonds by removing two hydrogen atoms at specific locations along the 
aliphatic hydrocarbon chains. Desaturase enzymes are specific to the position, number, and 
stereochemistry of the double bonds already present in the target fatty acid (Heinz, Lipid 
Metabolism in Plants, 33-89, 1993). 

35 To synthesize 20-carbon polyunsaturated fatty acids, mammals must acquire the 

essential fatty acids 18:2 (Brenner, The Role of Fats in Human Nutrition, pp. 45-79, 1989) 
and 18:3 (Nelson, Fatty Acids in Foods and Their Health Implications, pp. 437-471, 1992; 
Brenner. The Role of Fats in Human Nutrition, pp. 45-79, 1989; and Hulanicka et al. J. Biol. 
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Chem. 239:2778-2787, 1964) from their diet (Nelson, Fatty Acids in Foods and Their Health 
Implications, 437-471,1992). These dietary polyunsaturated fatty acids are metabolized in 
the endoplasmic reticulum by an alternating series of position-specific desaturases and 
malonyi-CoA-dependent chain-elongation steps (FIG. 1A), which results in the characteristic 
5 methylene-interrupted double bond pattern. In the !iver, which is the primary organ of human 
lipid metabolism, the first step in biosynthesis of 20-carbon fatty acids is desaturation of the 
essential fatty acids at the A® position. The desaturation products are elongated to 20:3 and 
20:4 (Cook et aL, J. Lipid Res. 32:1265-1273, 1991). In tum, these 20-carbon products are 
desaturated by a A^-desaturase to produce arachidonate and eicosapentaenoate. The A^- 

10 desaturation step is rate-limiting in this metabolic pathway (Bernet and Sprecher, Biochim. 

Biophys. Acta 398:354-363. 1975; and Yamazaki et aL, Biochim. Biophys. Acta 1123:18-26, 
1992) and, not surprisingly, is subject to regulation by dietary and hormonal changes 
(Brenner, The Role of Fats in Human Nutrition, pp. 45-79, 1989). 

In contrast to the liver, an altemate pathway for biosynthesis of 20-carbon 

15 polyunsaturated fatty acids has been demonstrated in a few organisms and tissues (FIG. 
IB). Instead of desaturation, the first step in the altemate pathway is elongation of the 
essentia! 18-carbon fatty acids to 20-carbon chain lengths, producing 20:2 (Ulsamer et al., J. 

Ce//-B/Q/.JI3:105^114. 1969; and Albert et ai. Lipids 14:498-500, 1979) and 20:3. 

Subsequent desaturation occurs via a A°-desaturase (FIG. 1). The products of this 

20 elongation-desaturation, 20:3 and 20:4. are the same as the more usual desaturation- 
elongafion pathway. The A® pathway is present in the soil amoebae Acanthamoeba sp. 
(Ulsamer et al, J. Ceil Biol. 43:105-1 14, 1969), and in euglenoid species, where it is the 
dominant pathway for formation of 20-carbon polyunsaturated fatty acids (Hulanicka et a!., 
Journal of Biological Chemistry 239:2778-2787, 1964). 

25 This A®-desaturation pathway occurs in mammals, both in rat testis (Albert and 

Coniglio, Biochim. Biophys. Acta 489:390-396. 1977) and in human testis (Albert et al.. 
Lipids 14:498-500, 1979). While A® activity has been observed in breast cancer cell lines 
(Grammatikos et al., Sr. J. Cancer 70:219-227, 1994; and Bardon et al., Cancer Lett. 99:51- 
58, 1996) and in glioma (Cook etal., J. Upid Res. 32:1265-1273, 1991), no A® activity is 

30 detectable in a corresponding non-cancerous breast cell line (Grammatikos et al., Br. J. 

Cancer 70:219-227, 1994) or in the brain (Dhopeshwari<ar and Subramanian, J. Neurochem. 
36: 1 1 75-1 1 79, 1 976). The significance of A^-desaturation to normal or cancer cell 
metabolism is unclear, since analysis of desaturase activities in mammalian systems is 
frequently complicated by the presence of competing A® reactions and chain-shortening 

35 retroconversion of fatty acid substrates in tissue (Sprecher and Lee. Biochim. Biophys. Acta 
388:113-125, 1975; Geigeret al.. Biochim. Biophys. Acta 1170:137-142, 1993). 
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Polyunsaturated 20-carbon fatty acids are. for the reasons outlined above, important 
in the human diet, and there has been considerable recent interest in incorporating such 
fatty acids into infant food, baby formula, dietary supplements, and nutriceutical formulations. 

It would therefore be desirable to produce new transgenic plants and animals with 
enhanced ability to produce polyunsaturated 20-carbon fatty acids. 



SUMMARY OF THE DISCLOSURE 

The invention provides novel A^- (FIG. 6A) and A® - (FIG. 7A) desaturase enzymes 
that may be cloned and expressed in the ceils of various organisms, including plants, to 
produce 20-carbon polyunsaturated fatty acids. Expression of such fatty acids enhances the 
nutritional qualities of such organisms. For instance, oil-seed plants may be engineered to 
incorporate the A® - and A®-desaturases of the invention. Such oil-seed plants would 
produce seed-oil rich in polyunsaturated 20:3, 20:4, 20:5, 22:4, and 22:5 fatty acids. Such 
fatty acids could be incorporated usefully into infant formula, foods of all kinds, dietary 
supplements, nutriceutical. and pharmaceutical formulations. 

The invention also provides proteins differing from the proteins of FIG. 6A and FIG. 
7A by one or more conservative amino acid substitutions. Also provided are proteins that 
exhibit "substantial similarity" (defined in the "Definitions" section) with the proteins of FIG. 
6A and FIG. 7A. 

20 The invention provides isolated novel nucleic acids that encode the above- 

mentioned proteins, recombinant nucleic acids that include such nucleic acids and cells, and 
plants and organisms containing such recombinant nucleic acids. 

The novel A^ • and A® -desaturase enzymes can be used individually, or in 
conjunction with one another, for instance in a metabolic pathway, to produce 
25 polyunsaturated fatty acids, such as 20:3. 20:4, 20:5, 22:4, and 22:5 fatty acids. 

The scope of the invention also includes portions of nucleic acids encoding the novel 

A^ - and 

A°-desaturase enzymes, portions of nucleic acids that encode polypeptides substantially 
similar to these novel enzymes, and portions of nucleic acids that encode polypeptides that 

30 differ from the proteins of FIG. 6A and FIG. 7A by one or more conservative amino acid 
substitutions. Such portions of nucleic acids may be used, for instance, as primers and 
probes for research and diagnostic purposes. Research applications for such probes and 
primers include the identification and cloning of related A^ - and 
A®-desatu rases in other organisms including humans. 

35 The invention also includes methods that utilize the A^ - and/or the A® -desaturase 

enzymes of the invention. An example of this embodiment is a yeast or plant cell that carries 
genes for one or both desaturases of the invention and that, by virtue of these desaturases. 
is able to produce arachidonic acid and/or EPA. 



10 



15 



wo 00/34439 



PCTAJS99/28655 



BRIEF DESCRIPTION OF THE DRAWINGS 

FIG. 1 A shows a common pathway for synthesis of 20-carbon polyunsaturated fatty 
acids that begins with A® desaturation of 18-carbon fatty acids followed by 2-carbon 
5 elongation, and then further desaturation and elongation. 

FIG. 1B shows an altemate pathway that begins with an elongation of 18-carbon 
fatty acid to 20-carbon fatty acids, followed by A° desaturation and a second desaturation at 
the position. 

FIG. 1C shows alternate pathways for the synthesis of polyunsaturated fatty acids 

1 0 using A^-, a\ and A® -desaturases to produce arachidonic acid and EPA. 

FIG. 2 shows gas chromatographic (GC) analysis of fatty acid methyl esters from E. 
gracilis grown (heterotrophically) in the dark with sucrose as carbon source. Fatty acids 
were identified by comparison of retention times with known standards. Significant peaks 
are numbered with their retention times and proportion of the total fatty acid indicated. 

15 FIG. 3 shows amino acid sequence similarities between the Euglena A°-desaturase 

protein (EFD1) and the desaturase enzymes of C. elegans. The deduced amino acid 
sequence of the EFD1 gene shows similarity with the C. elegans A® (FAT-3) and A^ (FAT-4} 

desaturases (Napier et ai., Biochem. J, 330:61 1-614, 1998). The similarity is strongest in 

the regions of conserved function. In the N-terminal region amino acids forming a 

20 cytochrome bs -like domain (Lederer, Biochimie 76:674-692, 1994) are indicated. The His- 
box motifs indicated by underlined characters are present in other identified membrane 
desaturases (Napier et a!., Biochem. J, 330:61 1-614, 1998; Michaelson et a!., J. Biol. Cham. 
273:19055-19059, 1998; and Shanklin and Cahoon, Anna, Rev. Plant Physiol. Plant Moi 
Biol. 48:611-641, 1998). 

25 FIG. 4 shows the results of gas chromatography of fatty acid methyl esters from 

recombinant yeast. Cultures of yeast containing either control pYES2 or pYES2-541 , which 
expresses the Euglena 

A°-desaturase {EFD1) gene, were supplemented with the indicated 20-carbon fatty acids. 

The control strain does not desaturate the exogenous fatty acids. For the experimental 
30 strain, an arrow indicates the desaturation peak. 

FIG. 5 shows the results of mass spectrometry (MS) of desaturation products. 

DMOX derivatives of EFD1 desaturation products were analyzed by GC-mass spectrometry. 

The molecular ion of each fatty acid is 2 a.m.u. (atomic mass units) less than the substrate 

provided, as expected for insertion of a double bond. Desaturation at the A^ position is 
35 established by characteristic m/z peaks of 182 and 194 for each product, indicated by the 

bracket. 
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FIG. 6A shows the primary amino acid sequence of the fatty acid A^-desaturase 
from Caenorhabditis elegans, 

FIG. 6B shows a nucleotide sequence including the ORF (open reading frame) that 
encodes the fatty acid A^-desaturase from Caenorhabditis elegans. 
5 FIG. 7A shows the primary amino acid sequence of the fatty acid A®-desaturase 

from the protist Euglena graciiis. 

FIG. 7B shows a nucleotide sequence including the ORF that encodes the A^- 
desaturase from the protist Eugtena gracilis. 

FIG. 8 shows certain features of the structure of the C. elegans A^ - and A®- 
1 0 desaturase genes. The relative location of gene products T1 3F2.1 and W08D2.4 on their 
respective cosmids is shown above. The exon structure of T13F2.1 (faM) and W08D2.4 
{fat'3) showing the sites of sequences encoding the SL1 splice site, the heme-binding motif 
of cytochrome b5 (cyt bs), and the three conserved histidine box motifs (HBX) is shown 
below. 

1 5 FIG. 9 shows a comparison of the predicted amino acid sequences of the borage 

A^-desaturase (bordS), C. elegans FAT-3 (fat3), C. elegans FAT-4 (fat4), and the Mortierella 

aipina A^-{mord5) desaturase. Identical or conserved residues are shaded, and the 

conserved-*HPGG-heme=binding-domain'and-the-conserved-histidine-b^^ — 

Abbreviations: bord6 = Borago officinalis 

20 A^-desaturase (GenBank accession number U79010); fat4 = C. elegans FAT-4 desaturase; 
fat3 = C. elegans A desaturase sequence of W08D2.4 (GenBank accession number 
Z70271), edited to remove amino acids 38-67, on the basis of the cDNA sequence; mordS = 
Mortierella aipina A^ desaturase (GenBank accession number AF054824). 

FIGS. 10A-10C show identification of arachidonic acid in transgenic yeast by gas 

25 chromatography-mass spectroscopy (GC-MS). Fatty acid methyl esters of total lipids of S. 
cerevisiae grown for 16 hours under inducing conditions (2% galactose) supplemented with 
0.2 mM di-homo-Y-linolenic acid were analyzed by GC-MS. (A) Yeast transformed with 
(empty) vector pYES2. (B) Yeast transformed with pYES2 vector carrying fat-4. The 
common peaks were identified as 16:0 (11.19-11.12 min.), 16:1 (11.38 min.), 18:0(13.07- 

30 13.08 min.). 18:1 (13.29 min.). 20:3 (11.64-11.65 min.). The novel peaks are arachidonic 

acid (14.49 min.) and 18:2 (12.91 min.). (C) The mass spectnjm of the peak eluting at 14.49 
min. This spectrum is indistinguishable from that of authentic methyl-arachidonate. 

FIGS. 11A and 11B show the novel desaturation products from substrates lacking a 
L double bond. (A) Partial GC trace of fatty acid methyl esters derived from yeast 

35 expressing the faM 
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A^-desaturase supplemented with 20:2A^^'^'* (14.81 min.). The desaturation product of this 

5,11,14 

substrate elutes at 14.62 min. and has been identified as 20:3A . (B) Partial GC trace of 
yeast expressing the /iaf-4 

A^-desaturase supplemented with 20:3A^^'^'*'^^ (14.87 min.). The desaturation product of this 

5.11.14.17 

5 substrate elutes at 14.69 min. and has been identified as 20:4A 

5 6 

FIG. 12 is a table comparing the substrate specificities of C. elegans A " and A " 
desaturases. 

FIG. 13 is a table comparing incorporation and desaturation of fatty acids by yeast 
strains transformed with a control construct pYES, and with pYES-541, a clone containing 
1 0 EGD1 , the E. gracilis A®-desaturase gene. S. cervisiae strains containing a control vector 
(pYES) or expressing EFD1 (pYES-541) were cultured in the presence of the indicated fatty 
acids. The cultures were harvested, washed, and methyl esters prepared from total cells 
and analyzed by GC. The weight % of total fatty acid methyl esters is indicated. 

15 SEQUENCE LISTING 

The nucleic and amino acid sequences listed in the accompanying sequence listing 
are shown using standard letter abbreviations for nucleotide bases, and three-letter code for 
amino acids. Only one strand of each nucleic acid sequence is shown, but the 
complementary strand is understood to be included by any reference to the displayed strand. 

SEQ ID NO: 1 is the nucleotide sequence corresponding to the open reading frame 
of the fatty acid A^-desaturase from Caenoiiiabditis elegans. 

SEQ ID NO: 2 is the primary amino acid sequence of the fatty acid A^-desaturase 
from Caenorhabditis elegans. 

SEQ ID NO: 3 is the nucleotide sequence corresponding to the open reading frame 
of the fatty acid A°-desaturase from the protist Euglena gracilis. 

SEQ ID NO: 4 is the primary amino acid sequence of fatty acid A®-desaturase from 
the protist Euglena gracilis. 

SEQ ID NOs: 5-8 are primers used to amplify and clone the A®-desaturase- 
encoding nucleic acid sequence. 

SEQ ID NO: 9 is a polyadenylation signal. 

SEQ ID NO: 10 is a primer used to amplify and clone the A^-desaturase-encoding 
nucleic acid sequence. 

SEQ ID NO: 1 1 is a short RNA leader sequence. 
SEQ ID NO: 12 is the amino acid sequence of a histidine box motif. 
SEQ ID NO: 13 is the amino acid sequence of a histidine box motif 



20 



25 



30 



35 
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DESCRIPTION OF THE iNVENTION 

The following definitions and methods are provided to better define the present 
invention and to guide those of ordinary skill in the art in the practice of the present invention. 
Unless othenwise noted, temis are to be understood according to conventional usage by 
5 those of ordinary skill in the relevant art. Definitions of common terms in molecular biology 
may also be found in Rieger et ai , Glossary of Genetics: Classical and Molecular, 5th 
edition. Springer-Verlag: New York. 1991; and Lewin. Genes W. Oxford University Press: 
New York, 1997. The nomenclature for DNA bases as set forth at 37 C.F.R. § 1 .822 is 
used. The standard one- and three-letter nomenclature for amino acid residues is used. 

10 

Definitions 

Portion: A portion of a nucleic acid molecule is a stretch of contiguous nucleic 
acids corresponding to the sequence of that molecule that may be about 1 5, 20. 30. 40, 50, 
or 60 nucleic acids in length. Such nucleotide portions may be used as probes or primers. 

15 A portion of a protein is a stretch of contiguous amino acids corresponding to the amino acid 
sequence of that protein that may be about 5. 10, 20, 30. 40, or 50 residues in length. As 
used herein, such a portion may correspond to any segment of a nucleic acid molecule, for 

instance such a portion may correspond to a segment consisting of nucleotides 1-500, 501- 

1000, or 1001-1451 of the sequence shown in FIG. 6B, or nucleotides 1-400, 401-800, 801- 

20 1251 of the sequence shown in FIG. 7B. 

Desaturase: A desaturase is an enzyme that promotes the formation of carbon- 
carbon double bonds in a hydrocarbon molecule. 

Desaturase activity may t>e demonstrated by assays in which a preparation 

25 containing an enzyme is incubated with a suitable form of substrate fatty acid and analyzed 
for conversion of this substrate to the predicted fatty acid product. Alternatively, a DNA 
sequence proposed to encode a desaturase protein may be incorporated into a suitable 
vector construct and thereby expressed in cells of a type that do not normally have an ability 
to desaturate a particular fatty acid substrate. Activity of the desaturase enzyme encoded by 

30 the DNA sequence can then be demonstrated by supplying a suitable form of substrate fatty 
acid to ceils transformed with a vector containing the desaturase-encoding DNA sequence 
and to suitable control cells (for example, transformed with the empty vector alone). In such 
an experiment, detection of the predicted fatty acid product in ceils containing the 
desaturase-encoding DNA sequence and not in control cells establishes the desaturase 

35 activity. Examples of this type of assay have been described in, for example, Lee et al., 

Science 280:915-918. 1998; Napier et al., Biochem. J. 330:611-614, 1998; and Michaelson 
et aL. J. Biol. Chem. 273:19055-19059. 1998, which are incorporated herein by reference. 
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The -desaturase activity may be assayed by these techniques using, for example. 
20:3A°"^^''"* as substrate and detecting 20:4A^®'^^'^'* as the product (Michaelson et al., J. BioL 
Chem, 273:19005-19059, 1998). Other potential substrates for use in A^ activity assays 
include (but are not limited to) 10:2A^^-^'* (yielding 20:5A^*^'-^^ as the product) and 
5 20:3A'' ''' ''^ (yielding 20:4A^'^^'^^*^^ as the product). 

The A® -desaturase may be assayed by similar techniques using, for example, 
20:3A^^'^'* ''^ as the substrate and detecting 20:4A^'^^'^'^*^^ as the product. 

ORF: Open reading frame. An ORF is a contiguous series of nucleotide triplets 
1 0 coding for amino acids. These sequences are usually translatable into a peptide. 

Homologs: Two nucleotide or amino acid sequences that share a common 
ancestral sequence and diverged when a species carrying that ancestral sequence split into 
two species. Homologs frequently show a substantial degree of sequence identity. 

15 

Transformed: A transfomned cell is a cell into which has been introduced a nucleic 
acid molecule by molecular biology techniques. The term encompasses all techniques by 

whicha-nucleic-acid-moiecuie-might-be-introduced-into-such-a-Gell-including-transfection 

with viral vectors, transformation with plasmid vectors, and introduction of naked DNA by 

20 electroporation, lipofection, and particle gun acceleration. 

Purified: The term purified does not require absolute purity; rather, it is intended as 
a relative term. Thus, for example, a purified protein preparation is one in which the subject 
protein or other substance is more pure than in its natural environment within a cell. 
25 Generally, a protein preparation is purified such that the protein represents at least 50% of 
the total protein content of the preparation. 

Operably linked: A first nucleic acid sequence is operably linked with a second 
nucleic acid sequence when the first nucleic acid sequence is placed in a functional 
30 relationship with the second nucleic acid sequence. For instance, a promoter is operably 
linked to a coding sequence if the promoter affects the transcription or expression of the 
coding sequence. Generally, operably linked DNA sequences are contiguous and, where 
necessary to join two protein coding regions, in the same reading frame. If introns are 
present, the operably linked DNA sequences may not be contiguous. 

35 

Cell: A plant, animal, protist. bacterial, or fungal cell. 
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Sequenc similarity: The similarity between two nucleic acids or two amino acid 
sequences is expressed in terms of percentage sequence identity. The higher the 
percentage sequence identity between two sequences, the more similar the two sequences 
are. 

5 In the case of protein alignments, similarity is measured not only in terms of 

percentage Identity, but also takes into account conservative amino acid substitutions. Such 
conservative substitutions generally preserve the hydrophobicity and acidity of the 
substituted residue, thus preserving the structure (and therefore function) of the folded 
protein. The computer programs used to calculate protein similarity employ standardized 

10 algorithms that, when used with standardized settings, allow the meaningful comparison of 
similarities between different pairs of proteins. 

Sequences are aligned, with allowances for gaps in alignment, and regions of 
identity are quantified using a computerized algorithm. Default parameters of the computer 
program are commonly used to set gap allowances and other variables. 

1 5 Methods of alignment of sequences for comparison are well-known in the art. 

Various programs and alignment algorithms are described by Pearson et ah, Methods in 
Molecular Biology 24: 307-331, 1994, and In Altschul et al.. Nature Genet 6:119-129, 1994. 

Altschul et al. presents a detailed conside ration of sequence alignment methods and 

homology calculations. 

20 The NCBI Basic Local Alignment Search Tool (BLAST"^) (Altschul et al., J. Mol, 

BioL 215:403-410, 1990 is available from several sources, including the National Center for 
Biotechnology Information (NBCI, Bethesda, MD) and on the Internet, for use in connection 
with the sequence analysis programs blastp, blastn. blastx. tbiastn and tblastx. BLAST™ 
can be accessed at htD:/ /www.ncbi.nlm.nih.Qov/BLAST/ . A description of how to determine 

25 sequence identity using this program is available at the web site. As used herein, sequence 
Identity is commonly determined with the BLAST^**^ software set to default parameters. For 
instance, blastn (version 2.0) software may be used to determine sequence identity between 
two nucleic acid sequences using default parameters (expect =10, matrix = BLOSUM62, 
filter = DUST (Tatusov and Lipmann, in preparation as of December 1, 1999; and Hancock 

30 and Armstrong, Comput. Appl. Biosci. 10:67-70, 1994), gap existence cost =11. per residue 
gap cost = 1, and lambda ratio = 0.85). For comparison of two polypeptides, blastp (version 
2.0) software may be used with default parameters (expect 10, filter = SEG (Wootton and 
Federhen, Computers in Chemistry 17:149-163, 1993), matrix = BLOSUM62. gap existence 
cost =11. per residue gap cost = 1, lambda = 0.85). 

35 When aligning short peptides (fewer than around 30 amino acids), the alignment 

should be performed using the Blast 2 sequences function, employing the PAM30 matrix set 
to default parameters (open gap 9, extension gap 1 penalties). 
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An alternative alignment tool is the ALIGN Global Optimal Alignment tool (version 
3.0) available from Biology Workbench at http://biology.ncsa.uiuc,edu. This tool may be 
used with settings set to default parameters to align two known sequences. References for 
this tool include Meyers and Miller, Cy4e/OS 4: 11 -17, 1989. 

5 

Conservative amino acid substitutions are those substitutions that, when made, 
least interfere with the properties of the original protein, i.e., the structure and especially the 
function of the protein is conserved and not significantly changed by such substitutions. The 
table below shows amino acids that may be substituted for an original amino acid in a 
1 0 protein and that are regarded as conservative substitutions. 

TABLE 1 



Original Residue 


Conservative Substitutions 


ala 


ser 


arg 


lys 


asn 


gin; his 


asp 


glu 


cys 


ser 


gin 


asn 


glu 


asp 


gly 


pro 


his 


asn; gin 


ile 


leu; val 


leu 


ile; val 


lys 


arg; gin; glu 


met 


teu; ile 


phe 


met; leu; tyr 


ser 


thr 


thr 


ser 


trp 


tyr 


tyr 


trp; phe 


vat 


ile; leu 



Conservative substitutions generally maintain (a) the structure of the polypeptide 
backbone in the area of the substitution, for example, as a sheet or helical conformation, 
1 5 (b) the charge or hydrophobicity of the molecule at the target site, or (c) the bulk of the side 
chain. 
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The substitutions which in general are expected to produce the greatest changes in 
protein properties will be non-consen/ative, for instance changes in which (a) a hydrophilic 
residue, e.g.. seryl or threonyl. is substituted for (or by) a hydrophobic residue, e.g., leucyi, 
isoleucyl, phenylalanyl. valyl or alanyl; (b) a cysteine or proline is substituted for (or by) any 
5 other residue; (c) a residue having an electropositive side chain, e.g.. lysyl, arginyl. or 

histadyl, is substituted for (or by) an electronegative residue, e.g., glutamyl or aspartyl; or 
(d) a residue having a bulky side chain, e.g., phenylalanine, is substituted for (or by) one not 
having a side chain, e.g., glycine. 

1 0 Probe: An isolated nucleic acid attached to a detectable label or reporter molecule. 

Typical labels include radioactive isotopes, ligands, chemiluminescent agents, and enzymes. 

Primers: Short nucleic acids, preferably DNA oligonucleotides 10 nucleotides or 
more in length, that are annealable to a complementary target DNA strand by nucleic acid 

1 5 hybridization to form a hybrid between the primer and the target DNA strand, then 

extendable along the target DNA strand by a DNA polymerase enzyme. Primer pairs can be 
used for amplification of a nucleic acid sequence, e.g., by the polymerase chain reaction 
(PGR) or other nucleic-acid amplification methods known in the art. 

Probes and primers as used in the present invention typically comprise at least 15 

20 contiguous nucleotides. In order to enhance specificity, longer probes and primers may also 
be employed, such as probes and primers that comprise at least 20, 30, 40, 50, 60, 70, 80, 
90, 100, or 150 consecutive nucleotides of the disclosed nucleic acid sequences. 

Alternatively, such probes and primers may comprise at least 15, 20. 30, 40, 50, 60, 
70, 80, 90, 100, or 150 consecutive nucleotides that share a defined level of sequence 

25 identity with one of the disclosed sequences, for instance, at least a 50%. 60%, 70%, 80%, 
90%, or 95% sequence identity. 

Alternatively, such probes and primers may be nucleotide molecules that hybridize 
under specific conditions and remain hybridized under specific wash conditions such as 
those provided below. These conditions can be used to identifying variants of the 

30 desaturases. Nucleic acid molecules that are derived from the desaturase cDNA and gene 
sequences include molecules that hybridize under various conditions to the disclosed 
desaturase nucleic acid molecules, or fragments thereof. Generally, hybridization conditions 
are classified into categories, for example very high stringency, high stringency, and low 
stringency. The conditions for probes that are about 600 base pairs or more in length are 

35 provided below in three corresponding categories. 
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Very High Stringency (detects sequences that share 90% sequence identity) 



Hybridization 


in 


ssc 


at 


65°C 


16 hours 


Wash tNvice 


in 


ssc 


at 


room temp. 


15 minutes each 


Wash twice 


in 


ssc 


at 


65°C 


20 minutes each 



High Stringency (detects sequences that share 80% sequence identity or greater) 



Hybridization 


in 


SSC 


at 


65**C- 
70^C 


16-20 hours 


Wash twice 


in 


ssc 


at 


room 
temp. 


5-20 minutes each 


Wash twice 


in 


ssc 


at 


55°C- 
70**C 


30 minutes each 



5 Low Stringency (detects sequences that share greater than 50% sequence Identity) 





Hybridization 


in 


SSC 


at 


room temp.- 
55X 


16-20 hours 




Wash at least 


in 


SSC 


at 


room temp.- 


20-30 minutes 




twice 








55**C 


each 





Methods for preparing and using probes and primers are described in the 
references, for example Sambrook et ai. Molecular Cloning: A Laboratory Manual, 2""* ed.. 
Cold Spring Harbor Laboratory Press, NY, 1989; Ausubel et al. Current Protocols in 
Molecular Biology, Greene Publishing Associates and Wiley-lntersciences, 1987; and Innis 
et al.. PCR Protocols, A Guide to Methods and Applications, Academic Press, Inc., San 
Diego, Califomia. 1990. PCR primer pairs can be derived from a known sequence, for 
example, by using computer programs intended for that purpose such as Primer (Version 
0.5, 1991. Whitehead Institute for Biomedical Research, Cambridge, MA). 

Recombinant nucleic acid: A sequence that is not naturally occurring or has a 
sequence that is made by an artificial combination of two otherwise separated segments of 
sequence. This artificial combination is often accomplished by chemical synthesis or, more 
commonly, by the artificial manipulation of isolated segments of nucleic acids, e.g., by 
genetic engineering techniques such as those described in Sambrook et al. Molecular 
Cloning: A Laboratory Manual, 2"^ ed,, Cold Spring Harbor Laboratory Press, NY. 1989. The 
term recombinant includes nucleic acids that have been altered solely by addition, 
substitution, or deletion of a portion of the nucleic acid. 



10 



15 



20 
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Native: The term "native" refers to a naturally-occurring ("wild-type") nucleic acid or 
polypeptide. The native nucleic acid or protein may have been physically derived from a 
particular organism in which it is naturally occurring or may be a synthetically constructed 
nucleic acid or protein that is identical to the naturally-occurring nucleic acid or protein. 

5 

Isolated: An "isolated" nucleic acid is one that has been substantially separated or 
purified away from other nucleic acid sequences in the cell of the organism in which the 
nucleic acid naturally occurs, i.e., other chromosomal and extrachromosomal DNA and RNA, 
by conventional nucleic acid-purification methods. The term also embraces recombinant 
1 0 nucleic acids and chemically synthesized nucleic acids. 

Plant: The term "plant" encompasses any higher plant and progeny thereof, 
including monocots {e.g., corn, rice, wheat, bariey. rapeseed, soy, sunflower, etc), dicots 
(e.g., potato, tomato, etc), and includes parts of plants, including seeds, fruit, tubers, etc 
1 5 The invention will be better understood by reference to the Examples herein. The 

scope of the invention is not to be considered limited thereto. 

Description And General Methods Of The Disclosure 

The present invention utilizes standard laboratory practices for the cloning, 

20 manipulation, and sequencing of nucleic acids, the purification and analysis of proteins, and 
other molecular biological and biochemical techniques, unless otherwise stipulated. Such 
techniques are explained in detail in standard laboratory manuals such as Sambrook et al., 
Molecular Cloning: A Laboratory Manual, 2"^ ed., Cold Spring Harbor Laboratory Press, NY, 
1989; and Ausubel et al.. Current Protocols In Molecular Biology, Green and Wiley- 

25 Interscience, NY, 1987. 

The inventors have identified, cloned, and expressed a novel fatty acid A® - 
desaturase from the protist Euglena gracilis and a novel fatty acid -desaturase from 
Caenorhabditis elegans that may be used together to produce polyunsaturated fatty acids. 
The invention provides novel purified A^ and A® proteins (FIG. 6A and FIG. 7A. 

30 respectively). The invention also provides proteins differing from the proteins of FIG. 6A and 
FIG. 7A by one or more conservative amino acid substitutions, as well as proteins that show 
"substantial similarit/' with the proteins of FIG. 6A and FIG. 7A. Substantial similarity is 
defined in the "Definitions" section. Proteins of the invention include proteins that show at 
least 50% amino acid similarity with the proteins shown in FIG. 6A and FIG. 7A. The term 

35 "50% amino acid similarity" is objectively and consistently defined by use of blastp sequence 
analysis software set at default parameters. Proteins of the invention also include proteins 
showing at least 60%, at least 70%. at least 80%, at least 90%, and at least 95% similarity 
(to the sequences of FIG. 6A or FIG. 7A) using blastp with default parameters. 
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The invention provides isolated novel nucleic acids that encode the above- 
mentioned proteins, recombinant nucleic acids that include such nucleic acids and cells 
containing such recombinant nucleic acids. Nucleic acids of the invention thus include 
nucleic acids that encode: (1) amino acid sequences as shown in FIG. 6A and FIG. 7A; (2) 
5 amino acid sequences that differ from the sequences shown in FIG. 6A and FIG. 7A by one 
or more conservative amino acid substitutions; and (3) amino acid sequences that show at 
least 50% similarity (as measured by blastp at default parameters) with the sequence of FIG. 
6A and FIG. 7A. 

Nucleic acids of the invention also include nucleic acids that show at least the term 

10 "50% similarity" with the nucleic acids shown in FIG. 6B and FIG. 7B. The term "50% 
similarity" is objectively defined by the use of blastn software set at default perimeters. 
Nucleic acids of the invention also include nucleic acids showing at least 60%, at least 70%, 
at least 80%. at least 90%, and at least 95% similarity (to the sequences of FIG. 6B and 
FIG.7B) using blastn with default perimeters. 

1 5 The novel - and A® -desaturase enzymes can be used individually, or in 

conjunction with one another, for instance in a metabolic pathway, to produce 
polyunsaturated fatty acids, such as 20:3 and 20:4 fatty acids. FIG. 1 B shows an example of 
such a metabolic pathway. Such a pathway may be engineered into any cell by use of 
appropriate expression systems. A simple way to provide such elements is by the use of 

20 commercially available expression systems, discussed in detail below. 

The scope of the invention covers not only entire nucleic acids encoding the novel 
A^ - and A^ desaturase enzymes (and substantially similar derivatives of such enzymes) but 
also covers "portions" of such nucleic acids (as defined in the "Definitions" section, herein). 
Such claimed portions are identified by their possession of a particular degree of similarity 

25 with similar sized portions of the nucleotides of FIG. 6B and FIG. 7B and may have a length 
of about 15, 20, 30, 40, or 50 contiguous nucleotides. Similarity is objectively measured by 
sequence comparison software, such as the "blastn" and "blastp" software available from the 
National Center for Biotechnology Information (NBCI, Bethesda, MD) and on the Internet at 
htD:/ /www.ncbi.nlm.nih.Qov/BLASTA Similarity between portions of nucleic acids claimed 

30 and similar sized portions of the nucleic acid sequences of FIG. 6B and FIG. 7B may be at 

least 50%, 60%, 70%, 80%, 90%, 95%, or even 98%. Such portions of nucleic acids may be 
used, for instance, as primers and probes for research and diagnostic purposes. Portions of 
nucleic acids may be selected from any area of the sequences shown in FIG. 6B or FIG. 7B, 
for instance the first, second, third, etc., group of 100 nucleic acids as numbered in the 

35 figures. 

Recombinant nucleic acids, as mentioned above, may, for instance, contain all or 
portion of a disclosed nucleic acid operably linked to another nucleic acid element such as a 
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promoter, for instance, as part of a clone designed to express a protein. Cloning and 
expression systems are commercially available for such purposes. 

Various yeast strains and yeast-derived vectors are commonly used for expressing 
and purifying proteins, for example. Pichia pastoris expression systems are available from 
5 Invitrogen (Carlsbad. CA). Such systems include suitable Pichia pastoris strains, vectors, 
reagents, sequencing primers, and media. A similar system for expression of proteins in 
Saccharomyces cerevisiae is also available from Invitrogen. which includes vectors, 
reagents and media. For example, a nucleotide sequence (e.g.. a gene coding for the - 
or A® -desaturase enzyme of the invention) may be cloned into the yeast expression vector 

1 0 pYES2 and expressed under the control of an inducible promoter, such as a galactose- 
inducible promoter (GAL1). 

Non-yeast eukaryotic vectors may also be used for expression of the desaturases of 
the invention. Examples of such systems are the well known Baculovirus system, the 
Ecdysone-inducible mammalian expression system that uses regulatory elements from 

1 5 Drosophila melanogaster to allow control of gene expression, and the Sindbis viral 

expression system that allows high level expression in a variety of mammalian cell lines. 
These expression systems are also available from Invitrogen. 

Standard prokaryotic cloning vectors may also be used, for example pBR322, 
pUCIQ or pL/C19 as described in Sambrook et al, 1989. Nucleic acids encoding the 

20 desaturases of the invention may be cloned Into such vectors that may then be transformed 
into bacteria such as Escherischia coli (E. coli) which may then be cultured so as to express 
the protein of interest. Other prokaryotic expression systems include, for instance, the 
arabinose-induced pBAD expression system that allows tightly controlled regulation of 
expression, the IPTG-induced pRSET system that facilitates rapid purification of 

25 recombinant proteins and the IPTG-induced pSE402 system that has been constructed for 
optimal translation of eukaryotic genes. These three systems are available commercially 
from Invitrogen and, when used according to the manufacturer's instructions, allow routine 
expression and purification of proteins. 

Alternatively, and of particular importance to this invention, a plant expression 

30 system could be used. Riant expression systems are commercially available. A gene of 
interest of the invention may be cloned into a vector and the construct used to transform a 
plant cell. Any well known vector suitable for stable transfomriation of plant cells and/or for 
the establishment of transgenic plants may be used, including those described in, e.g., 
Pouwels et aL. Cloning Vectors: A Laboratory Manual, 1985, supp. 1987; Weissbach and 

35 Weissbach, Methods for Plant Molecular Biology, Academic Press. 1 989; and Gelvin et al. , 
Plant Molecular Biology Manual, Kluwer Academic Publishers, 1990. Such plant expression 
vectors can include expression control sequences (e.g., inducible or constitutive. 
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environmentally or deveiopmentally regulated, or cell- or tissue-specific expression-control 
sequences). 

Examples of constitutive plant promoters useful for expressing desaturase enzymes 
in plants include, but are not limited to, the cauliflower mosaic virus (CaMV) 35S promoter 
5 (see, e.g., Ode! et aL, Nature 313:810. 1985; Dekeyser et aL. Plant Cell 2:591 . 1990; and 
Terada and Shimamoto, MoL Gen. Genet 220:389, 1990); the nopaline synthase promoter 
(An et al.. Plant Physiol. 88:547, 1 988) and the octopine synthase promoter (Fromm et a!., 
P/anfCe// 1:977, 1989). 

A variety of piant-gene promoters that are regulated in response to environmental, 

1 0 hormonal, chemical, and/or developmental signals, also can be used for protein expression 
in plant cells, including promoters regulated by (1) heat (Callis et aL, Plant Physiol. 88:965, 
1988), (2) light {e.g., pea rbcS-3A promoter, Kuhlemeier et aL. Plant Ce// 1:471, 1989; maize 
rbcS promoter, Schaffner and Sheen, Plant Cell 3:997, 1991; or chlorophyll a/b-binding 
protein promoter, Simpson et aL, EMBO J. 4:2723, 1985), (3) hormones, such as abscisic 

15 acid (Marcotte et aL, Plant Cell 1:969, 1989), (4) wounding (e.g., wuni, Siebertz et al.. Plant 
Ce// 1:961 , 1989); or (5) chemicals such as methyl jasmonate, salicylic acid, or a safener. It 
may also be advantageous to employ (6) organ-specific promoters (e.g., Roshal et aL, 

EMBO J. 6:1155 . 1987 : Schemthaner et aL. EMBO J. 7:1249, 1988; BustosetaL, Plant Cell 

1:839. 1989; Zheng et aL, Plant J. 4:357-366, 1993). Tissue-specific expression may be 

20 facilitated by use of certain types of promoters, for example, the napin promoter is a seed- 
storage protein promoter from Brassica and specific to developing seeds. The p-conglycinin 
promoters drive the expression of recombinant nucleic acids thus allowing, the or A® 
proteins of the invention to be expressed only in specific tissues, for example, seed tissues. 
Plant expression vectors can include regulatory sequences from the 3'-untranslated 

25 region of plant genes (Thornburg et aL, Proc. Natl. Acad. Sci. USA 84:744, 1987; An et aL, 
Plant Cell 1:115, 1 989). e.g., a 3' terminator region to increase mRNA stability of the mRNA, 
such as the PI-II terminator region of potato or the octopine or nopaline synthase 3* 
terminator regions. 

Useful dominant selectable marker genes for expression in plant cells include, but 
30 are not limited to: genes encoding antibiotic-resistance genes (e.g., resistance to 

hyg'romycin, kanamycin, bleomycin, G418, streptomycin, or spectinomycin); and herbicide- 
resistance genes (e.g., phosphinothricin acetyltransferase). Useful screenable markers 
include, but are not limited to. p-glucuronidase and green fluorescent protein. 

The invention also provides cells or plants or organisms transformed with 
35 recombinant nucleic acid constructs that include all or a portion- of the newly discovered 
polynucleotides that encode the novel A^ and/or A° desaturase enzymes. An example of 
such a transformed plant or organism would be a potato, tomato, rapeseed, sunflower, soy, 
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wheat, or com plant. Multi-celled fungi, such as edible mushrooms, may also be 
transformed. Transfomned oil-seed plants are of particular interest as 20-carbon 
polyunsaturated fatty acids would accumulate within the seed-oil. 

Nucleic acid constructs that express a nucleic acid according to the invention can be 
5 introduced into a variety of host cells or organisms in order to alter fatty acid biosynthesis. 
Higher plant cells, eukaryotic, and prokaryotic host cells all may be so transformed using an 
appropriate expression system as described above. 

After a cDNA (or gene) encoding a desaturase has been isolated, standard 
techniques may be used to express the cDNA in transgenic plants in order to modify the 

1 0 particular plant characteristic. The basic approach is to clone the cDNA Into a 

transformation vector, such that the cDNA is operably linked to control sequences (e.g.. a 
promoter) directing expression of the cDNA in plant cells. The transformation vector is then 
introduced into plant cells by any of various techniques, for example by Agrobacterium- 
mediated transformation of plants or plant tissues, or by electroporation of protoplasts, and 

1 5 progeny plants containing the introduced cDNA are selected. All or part of the 

transformation vector stably integrates into the genome of the plant cell. That part of the 
transformation vector that integrates into the plant cell and that contains the introduced 
cDNA and associated sequences for controlling expression (the introduced "transgene") may 
beTefefredto as tH^"recombifTant expressiofrcassette:" 

20 Selection of progeny plants containing the introduced transgene may be made 

based upon the detection of an altered phenotype. Such a phenotype may result directly 
from the cDNA cloned into the transformation vector or may be manifested as enhanced 
resistance to a chemical agent (such as an antibiotic) as a result of the inclusion of a 
dominant selectable marker gene incorporated into the transformation vector. 

25 Successful examples of the modification of plant characteristics by transformation 

with cloned cDNA sequences are replete in the technical and scientific literature. Selected 
examples, which serve to illustrate the knowledge in this field of technology include: 
U.S. Patent No. 5,571,706 ("Plant Virus Resistance Gene and Methods") 
U.S. Patent No. 5,677,175 ("Plant Pathogen Induced Proteins") 

30 U.S. Patent No. 5,51 0,471 ("Chimeric Gene for the Transfomnation of Plants") 
U.S. Patent No. 5,750.386 ("Pathogen-Resistant Transgenic Plants") 
U.S. Patent No. 5,597,945 ("Plants Genetically Enhanced for Disease Resistance") 
U.S. Patent No. 5,589,615 ("Process for the Production of Transgenic Plants with Increased 
Nutritional Value Via the Expression of Modified 2S Storage Albumins") 

35 U.S. Patent No. 5,750,871 ("Transformation and Foreign Gene Expression in Brassica 
Species") 

U.S. Patent No. 5,268,526 ("Overexpression of Phytochrome in Transgenic Plants") 
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U.S. Patent No. 5.262,316 ("Genetically Transformed Pepper Plants and Methods for their 
Production") 

U.S. Patent No. 5»569.831 ("Transgenic Tonnato Plants with Altered Polygalacturonase 
Isoforms") 

5 These examples include descriptions of transformation vector selection. 

transformation techniques, and the construction of constructs designed to over-express the 
introduced cDNA. In light of the foregoing and the provision herein of the desaturase amino 
acid sequences and nucleic acid sequences, it is thus apparent that one of skill in the art will 
be able to introduce the cDNAs, or homologous or derivative forms of these molecules. Into 

1 0 plants in order to produce plants having enhanced desaturase activity. Furthermore, the 
expression of one or more desaturases in plants may give rise to plants having increased 
production of poly-unsaturated fatty acids. 

The invention also pertains to antibodies to the desaturase enzymes, and fragments 
thereof, these antibodies may be useful for purifying and detecting the desaturases. The 

1 5 provision of the desaturase sequences allows for the production of specific antibody-based 
binding agents to these enzymes. 

Monoclonal or polyclonal antibodies may be produced to the desaturases, portions 
of the desaturases, or variants thereof. Optimally, antibodies raised against epitopes on 
these antigenswil! specificallycJetectlhe enzyme. TMtisTantibodiesraised against the 

20 desaturases would recognize and bind the desaturases, and would not substantially 

recognize or bind to other proteins. The determination that an antibody specifically binds to 
an antigen is made by any one of a number of standard immunoassay methods; for 
instance, Western blotting , Sambrook et al. (ed.), Molecular Cloning: A Laboratory Manual, 
2nd ed-: Vols. 1-3, Cold Spring Harbor Laboratory Press, Cold Spring Harbor. NY, 1989. 

25 To determine that a given antibody preparation (such as a preparation produced in a 

mouse against the -desaturase) specifically detects the desaturase by Western blotting, 
total cellular protein is extracted from cells and electrophoresed on a SDS-polyacrylamide 
gel. The proteins are then transferred to a membrane (for example, nitrocellulose) by 
Western blotting, and the antibody preparation is incubated with the membrane. After 

30 washing the membrane to remove non-specifically bound antibodies, the presence of 

specifically bound antibodies is detected by the use of an anti-mouse antibody conjugated to 
an enzyme such as alkaline phosphatase; application of 5-bromo-4-chloro-3-indolyl 
phosphate/nitro blue tetrazolium results in the production of a densely blue-colored 
compound by immuno-localized alkaline phosphatase. 

35 Antibodies that specifically detect a desaturase will, by this technique, be shown to 

bind substantially only the desaturase band (having a position on the gel determined by the 
molecular weight of the desaturase). Non-specific binding of the antibody to other proteins 
may occur and may be detectable as a weaker signal on the Westem blot (which can be 
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quantified by automated radiography). The non-specific nature of this binding will be 
recognized by one skilled in the art by the weak signal obtained on the Western blot relative 
to the strong primary signal arising from the specific anti-desaturase binding. 
Antibodies that specifically bind to desaturases belong to a class of molecules that are 
5 referred to herein as "specific binding agents." Specific binding agents that are capable of 
specifically binding to the desaturase of the present invention may include polyclonal 
antibodies, monoclonal antibodies, and fragments of monoclonal antibodies such as Fab, 
F(ab')2, and Fv fragments, as well as any other agent capable of specifically binding to one 
or more epitopes on the proteins. 
1 0 Substantially pure desaturase suitable for use as an immunogen can be isolated 

from transfected cells, transformed cells, or from wild-type cells. Concentration of protein in 
the final preparation is adjusted, for example, by concentration on an Amlcon filter device, to 
the level of a few micrograms per milliliter. Alternatively, peptide fragments of a desaturase 
may be utilized as immunogens. Such fragments may be chemically synthesized using 
1 5 standard methods, or may be obtained by cleavage of the whole desaturase enzyme 

followed by purification of the desired peptide fragments. Peptides as short as three or four 
amino acids in length are immunogenic when presented to an immune system in the context 
of a Major Histocompatibility complex (MHC) molecule, such as MHC class I or MHC class 
II. Accordingly, peptides comprising at least 3 and preferably at least 4, 5, 6, or more 
consecutive amino acids of the disclosed desaturase amino acid sequences may be 
employed as immunogens for producing antibodies. 

Because naturally occurring epitopes on proteins frequently comprise amino acid 
residues that are not adjacently arranged in the peptide when the peptide sequence is 
viewed as a linear molecule, it may be advantageous to utilize longer peptide fragments from 
the desaturase amino acid sequences for producing antibodies. Thus, for example, 
peptides that comprise at least 10. 15, 20, 25, or 30 consecutive amino acid residues of the 
amino acid sequence may be employed. Monoclonal or polyclonal antibodies to the intact 
desaturase, or peptide fragments thereof may be prepared as described below. 

Monoclonal antibody to any of various epitopes of the desaturase enzymes that are 
identified and isolated as described herein can be prepared from murine hybridomas 
according to the classic method of Kohler & Milstein, Nature 256:495, 1975, or a derivative 
method thereof. Briefly, a mouse is repetitively inoculated with a few micrograms of the 
selected protein over a period of a few weeks. The mouse is then sacrificed, and the 
antibody-producing cells of the spleen isolated. The spleen cells are fused by means of 
polyethylene glycol with mouse myeloma cells, and the excess unfused cells destroyed by 
growth of the system on selective media comprising aminopterin (HAT media). The 
successfully fijsed cells are diluted and aliquots of the dilution placed in wells of a microtiter 
plate where growth of the culture is continued. Antibody-producing clones are identified by 
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detection of antibody in the supernatant fluid of the wells by innmunoassay procedures, such 
as ELISA. as originally described by Engvall. Enzymol. 70:419, 1980, or a derivative method 
thereof. Selected positive clones can be expanded and their monoclonal antibody product 
harvested for use. Detailed procedures for monoclonal antibody production are described in 
5 Harlow & Lane, Antibodies, A Laboratory Manual, Cold Spring Harbor Laboratory, New York, 
1988. 

Polyclonal antiserum containing antibodies to heterogenous epitopes of a single 
protein can be prepared by immunizing suitable animals with the expressed protein, which 
can be unmodified or modified, to enhance immunogenicity. Effective polyclonal antibody 
1 0 production is affected by many factors related both to the antigen and the host species. For 
example, small molecules tend to be less immunogenic than other molecules and may 
require the use of carriers and an adjuvant. Also, host animals vary in response to site of 
inoculations and dose, with both inadequate or excessive doses of antigen resulting in low- 
titer antisera. Small doses (ng level) of antigen administered at multiple intradermal sites 
1 5 appear to be most reliable. An effective immunization protocol for rabbits can be found in 
Vaitukaitis et al., J. Clin. Endocrinol Metab. 33:988-991, 1971. 

Booster injections can be given at regular intervals, and antiserum harvested when 
the antibody titer thereof, as determined semi-quantitatively, for example, by double 
immunodiffusion in agar against known concentrations of the antigen, begins to fall. See, for 
example, Ouchterlony et al., Handbook of Experimental Immunology, Wier, D. (ed.), Chapter 
19, Biackwell. 1973. A plateau concentration of antibody is usually in the range of 0.1 to 0.2 
mg/mL of serum (about 12 nM). Affinity of the antisera for the antigen is determined by 
preparing competitive binding curves using conventional methods. 

Antibodies may be raised against the desaturases of the present invention by 
subcutaneous injection of a DNA vector that expresses the enzymes in laboratory animals, 
such as mice. Delivery of the recombinant vector into the animals may be achieved using a 
hand-held form of the Biolistic system (Sanford et al.. Particulate Sci. TechnoL 5:27-37, 
1987, as described by Tang et al., Nature (London) 356:153-154, 1992). Expression vectors 
suitable for this purpose may include those that express the cDNA of the enzyme under the 
transcriptional control of either the human p-actin promoter or the cytomegalovirus (CMV) 
promoter. Methods of administering naked DNA to animals in a manner resulting in 
expression of the DNA in the body of the animal are well known and are described, for 
example, in U.S. Patent Nos. 5,620,896 f DNA Vaccines Against Rotavirus Infections'*); 
5,643,578 ("Immunization by Inoculation of DNA Transcription Unit'); and 5,593,972 
("Genetic Immunization"), and references cited therein. 

Antibody fragments may be used in place of whole antibodies and may be readily 
expressed in prokaryotic host cells. Methods of making and using immunologically effective 
portions of monoclonal antibodies, also referred to as "antibody fragments," are well known 
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and include those described in Better & Horowitz, Methods EnzymoL 178:476-496. 1989; 
Glockshuber et a!. B/oc/7em/sfry 29:1362-1367, 1990; and U.S. Patent Nos. 5.648,237 
("Expression of Functional Antibody Fragments"); No. 4,946.778 ("Single Polypeptide Chain 
Binding Molecules"); and No. 5,455,030 ("Inr^munotherapy Using Single Chain Polypeptide 
5 Binding Molecules"), and references cited therein. 

Experimental Examples 

Example 1: Organism Strains and Culture 

The strain Euglena gracilis Z was obtained from Columbia Scientific. The organism 
10 was cultured on Cramer and Meyers medium (Cramer, and Meyers, Archiv fur Mikrobiologie 
17:384-402, 1952) with the addition of sucrose as a carbon source. Cultures were 
maintained at 25°C in absolute darkness. 

C. elegans was obtained from Caenorhabditis Genetics Center, St. Paul. Minnesota, 
and grown under standard conditions (Sulston et al., The Nematode Caenorhabditis elegans 
15 (Wood, W. B., Eds.), pp. 587-606, Cold Spring Harbor Laboratory Press, Cold Spring 
Harbor, NY, 1988). 

Example 2: Database Searches for C. elegans Gene Homoioqs 

The Sanger Center (http://www.sanger.ac.uk/projects/c_elegans/blast_ser^er.shtml) 

20 C. elegans genomic database was searched using BLAST™ with sequences of plant 
desaturase enzymes, including the S. officinalis A® -desaturase (GenBank accession 
number U79010). Two C. elegans polypeptides with the highest scores were a peptide on 
cosmid W08D2 (high score 163), and one on T13F2 (high score 121). 

25 Example 3: RNA Isolation. Reverse Transcription PCR. and 
RACE (Rapid Amplification of cDNA Ends) 

For the E. gracilis A® gene, total RNA was isolated from heterotrophic cultures of E. 

gracilis using a phenol-SDS protocol (Ausubel, Current Protocols In Molecular Biology, 

1988). Messenger RNA was purified from total RNA using the PolyA-tract system (Promega 

30 Scientific, Madison, Wisconsin). Reverse transcription reactions were carried out using 
Superscript II (Life Technologies, Rockville, Maryland). First-strand synthesis in the initial 
reactions was primed using anchored polyT primers (Clontech, Palo Alto, California). 
Second-strand synthesis was conducted as described (Life Technologies), and polymerase 
chain reaction amplification of the core region of the gene was accomplished using the 

35 primers (GGCTGGCTGACNCAYGARTTYTGYCAY; SEQ. ID NO. 5) and 

(CATCGTTGGAAANARRTGRTGYTCDATYTG; SEQ. ID NO. 6), designed to be completely 
degenerate to sequences overlapping the first and third His-box regions of the A®- 
desaturase C. elegans gene. 
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The amplification protocol was developed using published guidelines for use of 
degenerate primers (Compton, PCR Protocols: A Guide To Methods And Applications, 
1990). The amplification consisted of 5 preliminary cycles at very low annealing temperature 
(30 seconds at 94<*C, 1 -minute ramp to 37**C, 45 seconds at ST^'C, 3-minute ramp to 72°C) 
5 followed by 30 cycles with higher temperature (30 seconds at 94°C. 1 -minute ramp to SO^'C, 
45 seconds at 50^*0, 3-minute ramp to 72*'C. Preliminary amplifications to optimize thermal 
cycling parameters used Pfu DNA polymerase (Stratagene. La Jolla, California). 
Amplification was successful at 3 mM magnesium and each primer at 4 pM. Subsequently 
Taq polymerase was used for amplification under identical conditions. 
1 0 Polymerase chain reaction products from 350 to 750 bp were isolated from agarose 

gels with commercial reagents (Qiagen, Valencia, California) and sequenced directly using 
the degenerate primers and dye-termination sequencing technology (Applied Biosystems, 
Foster City. California). A group of identical amplification products contained an open 
reading frame that was homologous to known desaturases when analyzed by BLAST™ 
1 5 search (Altschul et al., Nucleic Acids Res. 25:3389-3402, 1 997). The 5' and 3* sequences of 
the complete mRNA were obtained with the Marathon RACE system (Clontech), using pairs 
of nested primers designed to amplify from within the core sequence. To clone the complete 
5' end of the gene, it was necessary to repeat the reverse transcription with a primer specific 
to the sequence of the open reading frame and repeat the 5* RACE amplification. 

For the C. elegans A® gene, RNA isolation and reverse transcription-PCR were 
performed as follows. RT-PCR was used to amplify the coding sequences of the two 
putative desaturase genes. Total RNA from mixed stage C. elegans was used for the RT- 
PCR template. The nematodes were grown on agar plates as described and RNA was 
isolated using the phenol/SDS method (Sluder et al., Dev. BioL 184:303-319, 1997). RT- 
PCR was performed using the Superscript^ One-Step RT-PCR system (Gibco-BRL/Life 
Technologies). Approximately 1 iig of total RNA was added to a reaction mixture consisting 
0.2 mM of each dNTP, 1.2 mM MgS04. Superscript ll™ RT/Taq polymerase mix, and 200 
yM of appropriate downstream and upstream primers. The reactions were incubated at 
50X for 30 min., then subject to 35 cycles of PCR amplification. For the T13F2.1 gene (fat- 
4) a 5' primer corresponding to bases 34339-34361 of cosmid T13F2 was used. Smal, 
H/ndlll, and Xho\ restriction sites were added to these sequences to facilitate cloning. The 
resulting primer [CCCGGGAAGCTTCTCGAGGAATTTTCAATCCTCCTTGGGTC; SEQ. ID 
NO: 7] anneals to the cosmid T13F2 19-42 base pairs upstream of the putative start codon 
ATG of the fat-4 gene. To amplify the 3' end of the fdt-4 gene, a primer was used 
corresponding to the inverse complement of bases 37075-37095 of cosmid T13F2, with the 
addition of Smal and SamH1 sites to facilitate cloning the polynucleotide of interest: 
CCCGGGTGGATCCGGAACATATCACACGAAACAG; SEQ. ID NO. 8]. This primer begins 
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93 base pairs after the putative stop codon TAG and ends 20 base pairs upstream of the 
predicted polyadenylation signal (AAUAAA; SEQ ID NO: 9) of the fat-4 gene. 

For the determination of trans-splicing of specific leader sequences, downstream 
primers corresponding to the complement of bases 35009-35028 of the T13F2 
5 (TCTGGGATCTCTGGTTCTTG; SEQ. ID NO: 10) were used for the T13F2.1 gene. The 
upstream PGR primers were either SL1-20 or SI-2-20 (Spieth et al.. Ce// 73:521-532. 1993). 
The C. elegans homologue of the ribosomal-protein L37 was used as an SL1 -specific 
control, and K06H7.3 was used as an SL2-specific control (Zorio et a!., Nature 372:270-272, 
1994. SL1-20, SL2-20, and control primers were kindly provided by Diego A. R. Zorio. RT- 
1 0 PGR products visualized by gel electrophoresis were confirmed by blotting the gel and 
probing with gene-specific oligonucleotides corresponding to the appropriate gene as 
previously described (Spieth et al., Ce// 73:21 -532, 1993). 

Example 4: PGR Amplification of the Genes Encoding A^and A° 
15 Desatu rases 

DNA and protein sequences were analyzed using the Wisconsin-GCG package of 
programs (Devereux et al., Nucleic Acids Res. 12:387-95. 1984). 

To clone the E. gracilis (A®) open reading frame as a single DNA fragment, a set of 
primers was used to prime a reverse transcription specifically^foTthe open reading"frame. 
The primer for the 5' end of the gene began 3 nucleotides before the start codon and 
included the first 26 nucleotides of the open reading frame. The 3' primer was 
complementary to the sequence between 22 and 52 nucleotides downstream from the 
predicted termination codon. This PGR amplification was conducted with Pfu polymerase to 
minimize the chance of an amplification error. The PGR reactions produced a single band of 
the predicted size when analyzed by agarose gel electrophoresis. This band was cloned into 
the vector pCR-Script Cam'^" (Stratagene), and a single clone designated pJW541 was 
chosen for analysis. 

To express the C. elegans -desaturase, the fat-4 cDNA amplification product (see 
Example 3) was digested with HiridWl and Ba/nHI and ligated to the yeast expression vector 
pYES2 (Invitrogen) cut with HindlW and BamHI. The resulting plasmid was named pYFAT4. 

Example 5: Exoresslon of A^ and A^ -desaturases 

For E. gracilis, the cloned A® gene was transferred to the yeast expression vector 
pYES2 (Invitrogen, Carisbad, Galifornia) by standard cloning techniques (Ausubel, Current 
Protocols In Molecular Biology, 1988) using enzymes obtained from New England Biolabs. 
Beveriy, Massachusetts. The resulting yeast expression construct containing the open 
reading frame under the control of a galactose-inducible promoter was designated pYES2- 
541. 
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Saccharomyces cerevisiae strain INVSd (Invitrogen) was transformed with pYES2- 
541 and cultured using standard methods (Ausubel, Current Protocols In Molecular Biology, 
1988). Liquid medium containing 2% galactose was supplemented with fatty acid soaps 
(NuCheck Prep, Elysian MN) at a final concentration of 0.2 mM. Tergitol (1%, NP40) was 
5 added to the yeast cultures to enhance fatty acid uptake (Stukey et al., J. Biol. Chem. 
264:16537-16544. 1989), except for cultures containing 20:1. where 5% DMSO was 
substituted. Yeast were incubated overnight at 28*'C , harvested by centrifugation, washed 
once with 1% Tergitol, once with 0.5% Tergitol, and finally once with distilled water. 

For the C. elegans gene, the constructs were transformed into Saccharomyces 

10 cerevisiae strain INVScI using the S.c. EasyComp transformation kit (Invitrogen). For 
experiments with the FAT-4 peptide, transformed yeast were grown overnight in uracil- 
deficient media containing 2% galactose, 0.2 mM fatty acid, and 1% NP-40- Under these 
conditions the percentage of these supplemented fatty acids which were incorporated into 
yeast lipids ranged from 14-28% of the total yeast fatty acids. For experiments in which 

15 20:1 a'' was used as a substrate, the 1% NP-40 was replaced by 5% DMSO to achieve 
better incorporation of this fatty acid. 

Example-6:-Analvsis_of_Fattv_Acids Using Gas Chromatography and 

GC-Mass Soectrometrv 

20 Extraction of lipids and preparation of fatty acid methyl esters was carried out by 

standard methods (Miquel and Browse. J, Biol, Chem. 267:1502-1509, 1992). Gas 
chromatography of the methyl esters was conducted by established methods (Spychalla et 
al., Proc, Natl. Acad. ScL i;SA 94:1142-1147, 1997). Fatty acid 4,4-Dimethyloxazoline 
(DMOX) derivatives of yeast lipid extracts were prepared by standard methods (Fay and 

25 Richli. J. Chromatogr. 541:89-98,. 1991). GC-mass spectrometry was conducted on a 

Hewlett-Packard 6890 series GC-MS fitted with a 30m x 0.25 nm HP5MS column, operating 
at an ionization voltage of 70 eV with a scan range of 50-550 Da. Fatty acids and their 
derivatives were identified where possible by comparison with authentic standards (NuCheck 
Prep). 

30 

Example 7: Identification and Amplification of the Eualena A° 
-desaturase Gene 

Messenger RNA isolated from heterotrophic cultures was used as template for 
reverse transcription followed by PGR amplification using degenerate primers that spanned 
35 the first and third conserved histidine-rich regions of microsomal desaturase proteins. The 
C. elegans A^-desaturase gene, FAT-3, was used as the principal basis for primer design. 
To compensate for the high degeneracy necessary in the primer pair, amplification reactions 
began with five cycles of low-temperature annealing and a long temperature ramp between 
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the annealing and polynnerization steps. Preliminary amplifications to optimize thermal 
cycling parameters used the proofreading Pfu DNA polymerase. After successful Pfu 
amplification reactions using high primer and magnesium concentrations, Taq polymerase 
was used to generate a number of bands detectable on agarose gels. 
5 Several of these bands, of approximately 650 bp, had identical sequence. This DNA 

sequence contained an open reading frame in which the predicted amino acid sequence was 
homologous to other membrane desaturases, and included a characteristic central His-box. 
Primers designed to be specific to the amplified sequence were used to amplify the termini 
of the cDNA using 3' and 5' RACE techniques. The full-length cDNA for this gene was 1745 

10 bp in length. It included an open reading frame of 1272 bp and a 472-bp 3* untranslated 
region. Most Euglena messenger RNAs are processed through the addition of a short 5' 
RNA leader sequence, the trans-spliced leader (Tessier et ai., Embo. J. 10:2621-2625, 
1991). This RNA processing step left a conserved sequence (TTTTTTTCG; SEQ. ID NO. 
11) at the beginning of each message (Cui et a!., J, Biochem. (Tokyo) 115:98-107, 1994). 

1 5 The presence of this leader in the cDNA sequence confirmed that the message was full- 
length at the 5' end. RT-PCR with primers flanking the open reading frame on the 5' and 3* 
ends resulted in a single band that was cloned into the vector pCR-Script Cam"^ 
(Stratagene), and designated pJW541 . The gene con^esponding to this ORF was 
designated EFD1 {Euglena fatty acid desaturase 1). 

20 

Example 8: Similarity Between Eualena -desaturase and Other 
Proteins 

The translated open reading frame indicated a protein of 422 amino acids with a 
predicted molecular mass of 48.8 kDa. (FIG. 3). A BLAST™ search of sequence databases 

25 revealed that the predicted protein sequence exhibited regions of homology with the known 
group of membrane fatty acid desaturases, especially in the highly conserved histidine-rich 
regions (Shanklin et al., 6/oc/7e/n/sf/y 33:12787-12794, 1994). 

Each of the His-box motifs is present in the EFD1 protein. The first (HXXXH) starts 
at amino acid 146 and the second (HXXHH; SEQ ID NO: 12) at amino acid 183 (FIG. 3). 

30 EFD1 contains a variant third His-box, QXXHH (SEQ ID NO: 13), starting at amino acid 361, 
similar to the cloned A^- and A® -desaturases. EFD1 exhibits conservation of protein 
sequence in the regions surrounding the highly conserved regions, especially with FAT-3 
and FAT-4, the A®- and A^-desaturases of C. elegans (FIG. 3). Outside the highly 
conserved regions, the amino acid sequence shows considerably less similarity to other 

35 desaturases. Overall, the amino acid identity with FAT-3 and FAT-4 is 33%, compared to 
28% identity with the borage A®-desaturase. 

EFD1 also contains a cytochrome bs-like motif at its N-terminus. The protein 
encodes seven of the eight most highly conserved amino acids characteristic of cytochrome 
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bs (FIG. 3), which are responsible for heme binding. Similar motifs are found at the N- 
temilnal regions of FAT-3 and FAT-4 (FIG. 3). and the borage A® protein, as well as the 
carboxyl terminus of a yeast A® protein. 

The structure of the Euglena protein also exhibits similarities with known 
5 desaturases. Membrane desaturases are type II multiple membrane spanning proteins, and 
hydropathy analysis of the cloned Euglena gene indicates that the predicted protein has at 
least three significant hydrophobic regions long enough to span the membrane bilayer twice. 
As is true for most desaturase enzymes, there are 31 amino acid residues between the first 
two His-boxes. The distance between the between the second and third His-box is 173 
1 0 residues, within the range previously observed (Shanktin and Cahoon. Anna. Rev. Plant 
Physiol. Plant Moi Biol. 48:611-641. 1998). 

Example 9: Activity of the Euglena A^ -desaturase Protein 

To confirm the activity of the enzyme, the EFD1 cDNA was transferred from 

1 5 pJW541 to yeast expression vector pYES2 under the control of a galactose-inducible 

promoter. The resulting construct, pYES2-541, was introduced into S. cerevisiae. Yeast 
membranes do not contain 20-carbon fatty acids but incorporate them from the culture 
medium. Accordingly, yeast cultures were supplemented with various fatty acid soaps, using 
a yeast strain containing the empty vector as control, and analyzed the fatty acids of the 

20 cultures by methyl-ester derivatization and gas chromatography. 

The patterns of desaturation activity in these experiments indicated that pYES2-541 
expresses a A®-desaturase enzyme that does not have A^ or A^ activity. The ability of the 
experimental yeast strain to produce A® desaturation was shown when the culture medium 
was supplemented with 20:2 (FIG. 4). A desaturation peak whose retention time is identical 

25 to authentic 20:3 was produced. The vector-only control culture did not desaturate 20:2 
(FIG. 4). The yeast strain expressing the Euglena gene also desaturated 20:3 and 20:1 
(FIG. 4). again without desaturation activity in the control cultures. The cloned Euglena 
protein was most active with 20:3 and 20:2 as substrates, desaturating 70% and 73% of the 
total incorporated 20-carbon fatty acid. EFD1 was least active with 20:1 , converting 32% of 

30 that substrate to a desaturation product (FIG. 4). 

When the culture medium was supplemented with a substrate for A^-desaturation, 
20:4, the fatty acid was incorporated into the yeast, but no 20:5 desaturation product was 
produced. Similariy, when the medium was supplemented with 18:2 and 18:3. no 
desaturation occurred, demonstrating that the cloned gene did not express a A®-desaturase. 

35 To confimi that desaturation had occurred at the A® position, 4,4-dimethyloxazoline 

(DMOX) derivatives of yeast fatty acids were analyzed by GC-MS. DMOX derivatives have 
mass spectra that are more easily interpreted than spectra of methyl esters, and permit 
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unambiguous determination of double-bond locations in polyunsaturated fatty acids (Christie, 
Lipids 33:343-353. 1998). For the experiment that desaturated the fatty acid 20:2, the 
retention time of the product on the GC-MS instrument was 16.8 min., identical to DMOX- 
derivatized authentic 20:3. The mass spectrum of this desaturation product and its 
5 molecular ion (m/z 359) indicated that it was the 20:3 compound. Two spectral frequency 
peaks at m/z 182 and 194, separated by only 12 a.m.u., showed that the introduced double 
bond was at the A® position (FIG. 5). (The substrate 20:2, which is saturated at the 8- 
position, has peaks at 182 and 196, separated by 14 a.m.u.) The spectrum of the product, 
with its desaturation peaks, was identical to that of authentic 20:3 (Luthria and Sprecher, 
10 Lipids 28:561-564, 1993). The other substrates, 20:1 and 20:3, were also desaturated at the 
A®-position by EFD1 . The peaks at m/z 182 and m^ 194 appeared in the spectrum of each, 
and the molecular ion was reduced from that of the substrate by two in each case (FIG. 5). 

Examole 10: Identification and Cloning of Two Fatty Acid C. eleaans 
15 Desaturase Genes 

Two high-scoring open reading frames were identified during a search of the C. 

elegans genomic DNA database with the borage A^-desaturase protein sequence. Both 

proteins predicted from these open reading frames. W08D2.4 and T13F2.1, contained an N- 

fermihal sequence resembling cytochrom'e'bsrirrctud ing'the"characteristic"(H PGG)'heme 

20 binding domain, and an H -h- Q substitution in the third histidine box. The W08D2.4 gene 
was denoted fat-3 and the T13F2.1 gene was denoted fat-4, since they both appear to 
encode fatty acid desaturases. Interestingly, the faf-3 and fat-4 genes are located next to 
each other on overlapping cosmids in the same 5* to 3' orientation, with only 858 nucleotide 
base pairs separating the putative polyadenylation signal of the faM gene and the ATG start 

25 codon of the fat-S gene (FIG. 8). This gene organization is reminiscent of operons, in which 
two or more genes are transcribed under the control of a single promoter and regulatory 
region. 

In C. elegans the polycistronic pre-mRNA is converted to monocistronic mRNA by 
cleavage and polyadenylation at the 3* end of the upstream gene and transplicing to the SL2 

30 sequence at the 5' end of the downstream gene, with the two mRNAs being subsequently 
independently translated. However, out of more than 30 such operons that have been 
analyzed, the distances between the 3* end of the upstream gene and the 5* end of the 
downstream gene are generally about 100 base pairs, with a few separated by 300-400 base 
pairs (Blumenthal et al, C. elegans II, pp. 117-145. Cold Spring Harbor Laboratory Press, 

35 Cold Spring, NY, 1997). 

The C, elegans fat'3 and fat-4 genes were tested to determine whether they were 
trans-spliced to either SL1 or SL2 in order to detemiine if they might be co-transcribed in a 
single operon. It was found that the fat-4 gene was transpliced to SL1 , but that the fat-3 
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gene was not transpiiced to either spliced leader sequence. Therefore, it was concluded 
that each gene contains its own 5' promoter and regulatory region. 

Both of these genes were cloned using RT-PCR. The fat-4 gene sequence matched 
the T13F2 genomic sequence exactly. However the gene product encoded by the cDNA 
5 was seven amino acids shorter than predicted by Genefinder for T13F2.1 (GenBank 

accession number 281 122) because the DNA sequence encoding amino acid residues 198- 
204 was not present in the fat-4 cDNA. The resulting peptide length was 447 amino acids 
instead of the previously predicted 454 amino acids. The gene product encoded by the fat'3 
cDNA also matched the genomic sequence of W08D2.4 (GenBank accession number 
10 Z70271) perfectly. However, the gene product was also shorter than the predicted protein 
sequence. Codons for amino acid residues 38-67 of W08D2.4 were not present in the 
cDNA. In both cases the gene-prediction software used in the genomic sequencing project 
appeared to have misidentified some intron DNA as coding sequences. 

5 

15 Example 11: Sequence Comparisons for C. eleaans A 

5 

The C. elegans FAT-3 and FAT-4 proteins, the Mortierella alpina A -desaturase, and 

6 

the B. officinalis A -desaturase appear to be proteins of similar structure in that they all 
contain^an N-terminal cytochrome dg-domainrthree histidine boxesrand-distinct-hydrophobic- 
membrane-spanning domains predicted by the TMHMM program from the Center for 
20 Biological Sequence Analysis, Technical University of Denmark 

{http://www.cbs.dtu. dk/services/TMHMM-1.0/). The predicted structure is consistent with the 
proposed desaturase structural model (Stukey etal,, J, BioL Chem,, 265:20144-20149. 
1990). Despite these similarities, the overall sequence identity among the four proteins is 

6 6 

quite low. For example, the FAT-3 A -desaturase and the borage A -desaturase share only 
25 28% identity on the amino acid level. The faf-4 gene product shares 25% amino acid identity 

6 5 

with the borage A -desaturase and 19% amino acid identity with the Mortierella alpina A - 
desaturase. Indeed the only portion of the FAT-4 protein that shows extended homology to 

5 

the M. alpina A -desaturase is a sequence of 36 residues incorporating the third His box 
which has 44% identity and 56% similarity. The most closely related pair of sequences are 
30 fat-3 and fat-4, which are 46% identical on the amino acid level and 54% identical over the 
entire cDNA sequence. 

6 

FIG. 9 shows the sequence comparison of the borage A -desaturase, C. elegans 

FAT-3, C. elegans FAT-4, and the Mortierella alpina A^-desaturase. The similar heme- 
binding domains (HPGG) and the three histidine box regions are underlined. The presence 
35 of these conserved motifs indicate that the fat-4 gene may encode a desaturase or a related 
fatty acid modifying enzyme. However it is not possible, from these sequence comparisons 
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6 5 

alone, to predict whether this gene encodes a A -desaturase, a A -desaturase, or a more 
distantly related enzynne. 

Exannple 12: Fatty Acid Desaturase Activity and Substrate Specificity 
5 in Yeast for C. eleaans A 

To determine the enzynnatic activity of the FAT-4 desaturase-like protein, the protein 
was expressed in Saccharomyces cerevisiae supplemented with polyunsaturated fatty acid 
substrates that are not normally present in this yeast. The FAT-4 protein was expressed in 
the yeast expression vector pYES2 from the GAL1 promoter by growing the cells in the 
1 0 presence of galactose and various fatty acids. After 16 hours of growth, the cells were 

analyzed for total fatty acid composition by gas chromatography (GC). Comparison of cells 

8,11.14 

supplemented with di-homo-y-linolenic acid (20:3A ) carrying pYES2 containing the fat-4 
coding sequence and cells carrying the vector alone revealed the presence of a major new 
peak eluting at 14.49 minutes in the cells expressing FAT-4 (FIG. 10B). The novel peak had 

^_ 5.8.11.14 

15 a retention time identical to that of the authentic arachidonic acid methyl ester (20:4A ), 

5.8,11,14 

and was determined to be arachidonic acid (20:4A ) because its mass spectrum was 
identical to that of authentic arachidonic acid methyl ester, including a mass ion peak at m/z 

318: 

The identity of this compound was further verified by converting the yeast fatty acid 

20 methyl esters into oxazoilne derivatives in order to produce structure specific mass spectra 
which simplify the determination of double-bond positions in a hydrocarbon chain. The mass 
spectrum of the DMOX derivative of the novel 20:4 component was consistent with the 
published spectrum for arachadonic acid and contained a prominent peak at m/z 153, which 

5 

is diagnostic of a double bond at the A position. Therefore, it was concluded that the faM 

5 

25 gene encodes a A -desaturase capable of synthesizing arachidonic acid from the substrate 
di-homo-y-linolenic acid. In contrast, the FAT-4 protein showed no activity when linoleic acid 

9.12 9,12,15 

(18:2A ) or y-linolenic acid (18:3A ) were provided as substrates, indicating an 

6 

absence of A "desaturase activity. 

Further analysis of the GC trace of the total fatty acids of the yeast cells expressing 
30 fat-4 revealed the presence of a second novel peak eluting at 12.91 minutes v^ich was not 
present in the empty vector control cells. Analysis of the mass spectrum of this novel peak 
revealed a molecular ion species of 294, identical to that of a methyl ester of an 18-carbon 
fatty acid with two double bonds (18:2), but its retention time and mass spectrum were not 

9.12 

identical to the common isomer 18:2 A 

5 

35 In microsomal extracts of mammalian liver, A -desaturase activity has been reported 

to act on a number of 18 and 20-carbon precursors to produce uncommon fatty acids such 
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as 18:2A^*^\ 20:3A^'^^*^* and 20:4A^'^^'^^*^^ (28, 29), Two species of slime molds have also 

5,9 5,11 5.11.14 ^ 5.11.14.17 

been reported to produce small amounts of 18:2A . 18:2A , 20:3A and 20:4A 
(Rezanka, Phytochemistry, 33:1441-1444. 1993). These fatty acids are unusual in that their 
double bonds do not follow the conventional methylene-interrupted pattern (one double bond 
5 every three carbons). 

Therefore, it was suspected that the novel peak exhibited on the GC spectrum is a 

5 9 11 

result of the C. elegans A -desaturase acting on 18:1 A [or 18:1 A . which in S, cerevisiae 

5.9 

constitutes 15-20% of the total 18:1] compound, to produce the uncommon isomer 18:2A 

5 11 

or 18:2A * . These yeast fatty acid methyl esters were converted into oxazoline derivatives. 
1 0 It was found that the mass spectrum of the DMOX derivative of the novel 18:2 component 
contained the •specific peak at m/z 153. However the larger ion peaks characteristic of 

9 11 

double bonds at the A or A position were not detected due to the small amount of this 
molecule present in the total yeast extracts. 

5 

To test if the C. elegans A -desaturase was capable of desaturating other substrates 
15 to produce other uncommon, non-methylene-interrupted fatty acids, the yeast expressing the 

5 11 

FAT-4-desaturase was supplemented with unconventional A -substrates such as 20:1 A , 
20:2A^^*^*, and 20:3A^^^^^'^^. No novel peaks were detected when the substrate 20:1 A^^ was 
fed to yeast. However, when 20:2A^^'^* and 20:3A^^'^^*^^ were provided as substrates, novel 
peaks were detected eluting at 14.62 minutes and 14.69 minutes, respectively (FIGS. 11A 
20 and 1 1 B, respectively). The mass-spectrum analysis of DMOX derivatives of these 

^^^^^ ^ 5,11.14,17 

molecules yielded results consistent with published values for 20:3A and 20:4A 

5 

including a prominent ion peak of m/^ 153 (which is diagnostic of double bonds at the A 
position). It was found, however, that these fatty acids were not produced to the same 
extent as arachidonic acid (20:4A^'^'^^'^'*) (FIG. 12). in these experiments. 55% of 
25 exogenously fed di-homo-y-linolenic acid (20:3A^*^^*^'*) was converted to arachidonic acid, 
while only 5%, 27%, and 26% of the 18:1. 20:3A^^'^^ and 2G:2A^^'^'*'^^ substrates were 
converted (FIG. 12). 

The fat-3 gene was expressed in the yeast expression vector pMK195 containing 
the constitutive ADH promoter The FAT-3 protein was able to desaturate linoleic acid 
30 (18:2A^'^^) into y-iinolenic acid (18:3A^'^*^% in agreement with published results (Napier et 
al., Biochem. J. 330:61 1-614, 1998). It was also found that FAT-3 was capable of 

9.12,15 6,9.12.15 .... 

desaturating a-linolenic acid (18:3A ) to 18:4A , a common reaction in animals. 
The FAT-3 protein showed no activity on 20:1 A^\ 20:2A^^'^^ 20:3A^'^^'^'*. or 20:3A^^'^^'^^. 
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Therefore, the substrate specificities of the C. elegans and A®-desaturases were 
determined to be specific and non-overlapping. 

Example 1 3: Discussion of the E. gracilis A° -desaturase. 
5 Desaturation at the A^-position has not been reported for any previously cloned gene 

(Tocher et a!.. Prog. Upid Res. 37:73-1 17, 1998). 

The predicted EFD1 protein has 33% amino acid identity with both FAT-3 and FAT- 
4, (FIG. 9), while its identity with the cloned borage A®-desaturase is 28%. The highest 
sequence conservation is found in the His-box motifs which are critical for desaturase 

1 0 activity, most likely because they serve as the diiron-oxo component of the active site 
(Shanklin et al., Biochemistry 33:12787-12794, 1994). Sequence conservation is also 
evident in the N-terminal cytochrome bs-like domain, where most of the essential residues of 
cytochrome bs (Lederer. Biochimie 76:674-692, 1994) that are retained in FAT-3 and FAT-4 
are also present in the EFD1 protein (FIG. 3). 

1 5 Expression of the EFD1 gene in yeast was used to characterize its activity. Three 

different 20-carbon substrates with double bonds at the A^^-position were desaturated (FIG. 
4), and analysis of the products indicated that, for each one, desaturation had taken place at 

the-A-position-(FIG.-5)._The_cloned Eu glena desaturase showed a clear preference for the 

substrates of metabolic significance with greater than two-fold preference for 20:2 and 20:3 

20 over 20:1 (Ulsamer et al., J. Cell BioL 43:105-114, 1969). Even though EFD1 is quite similar 
to other microsomal desaturases, its activity was specific, as evidenced by its inactivity on 
substrates for A^ and A® desaturation (FIG. 12). 

The 20-carbon substrates for A^ desaturation are available in abundance In 
heterotrophically grown E. gracilis (FIG. 2). These same substrates also are available in 

25 mammals, since 20:2 and 20:3 are produced by elongation from 18:2 and 18:3, in 

competition with the typical A^ desaturation (FIG. 1 ). Labeling experiments with rat liver 
homogenates indicate that elongation of the 18-carbon fatty acids is five-fold more rapid than 
the competing desaturation (Pawlosky et al., J. Lipid Res, 33:1711-1717, 1992). 

Implicit in the current understanding of the A® pathway of 20-carbon polyunsaturated 

30 fatty acid biosynthesis is a reliance on alternating desaturation and elongation to control flux 
through the pathway. While elongation often appears to be non-specific, most desaturations 
are specific both as to chain length of the substrate and to existing desaturation pattem of 
the fatty acid (Heinz, Lipid Metabolism in Plants, pp. 33-89. 1993). However, data from 
experiments in mammalian tissue (Bernert and Sprecher, Biochim. Biophys. Acta. 398:354- 

35 363, 1975; Albert et al.. Lipids 14:498-500, 1979) and with yeast expressing the C. elegans 
A^ -desaturase gene (FIG. 12), indicate that A^ enzymes desaturate fatty acids having a 
double bond at the A^^-position but not at the A^-position, producing the non-methylene- 
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interrupted 20:3 and 20:4 compounds at significant rates. For the A® pathway, A® 
desaturation occurs in competition with A^ activity on the substrates 20:2 and 20:3. In spite 
of this promiscuity of A^ enzymes, lipid profiles of mammalian tissue do not contain fatty 
acids with the A^*^^ (Heinz, Lipid Metabolism in Plants, pp. 33-89, 1993; and Ulsamer et al., 
5 J, Cell Biol 43:105-1 14, 1969) desaturation pattern, nor are they seen in Euglena, 

One explanation may be that A^-desaturation of the common substrates occurs very 
rapidly, while A^-desaturation proceeds more slowly, so that little A^'^^ product is formed. In 
support of this explanation, the Euglena A® appears to be a very active desaturase when 
expressed in yeast (FIG. 4), compared to similarly expressed A^ -desaturase enzymes. The 

1 0 Euglena desaturase must be sufficiently active to account for all the long chain 

polyunsaturates of rapidly growing Euglena cultures (FIG. 2). In contrast, the observed rates 
of A®-desatu ration in mammalian tissue are relatively slow. The highest apparent rate 
occurs in cancerous tissues without A® activity, where A® desaturation permits production of 
arachidonic acid at only 17% of the level of comparable normal cells with A® activity 

15 (Grammatikos et al.. Br. J. Cancer 70:2^9-227, 1994). 

Alternatively, the lack of A^ " unsaturated fatty acids in membranes could be 
explained if the A® desaturase accepts these fatty acids for desaturation. The convention 

that-A--desaturation-preeedes-A-aGtivity-(FIGv-1)-is-based on observations-of-desaturation 

reactions proceeding sequentially along the fatty acid hydrocarbon chain. The reverse order 

20 of desaturation, with A®- saturation preceding A®-desaturation, has been claimed (Takagi, J. 
Chem. Bull. Japan 38:2055-2057, 1965) and rejected (Schlenk et al., Lipids 5:575-577, 
1970) in mammalian liver, and proposed as a likely pathway based on experiments with 
deuterated substrates in glioma cells (Cook et al., J. Lipid Res. 32:1265-1273, 1991). 

In Euglena the products of A^-desatu ration. 20:3A®'^^-^'* and 20:4A®-^^-^'^ ''^ may be 

25 incorporated directly into membranes, or subjected to desaturation at the A^-position to 

produce arachidonic and eicosapentaenoic acids (Hulanicka et ai., J. Biol. Chem. 239:2778- 
2787, 1964). Further elongation and desaturation leads to several polyunsaturated 22- 
carbon fatty acids (FIG. 2). In mammals, whether the products are derived from A^ or A® 
activity, similar processes produce mostly arachidonic, eicosapentaenoic, and 

30 docosahexaenoic acid (22:6 (Hwang, Fatty Acids in Foods and Their Health Implications, pp. 
545-557. 1992; Bemert and Sprecher, Biochim. Biophys. Acta 398:354-363, 1975; Lees and 
Korn, S/oc/)e/7)/sf/y 5:1475-1481, 1966; Albert and Coniglio, Biochim. Biophys. iAcfa 489:390- 
396, 1977; Bardon et aL, Cancer Lett. 99:51-58, 1996; and Sprecher and Lee. 
Biochim. Biophys. Acta. 388:113-125. 1975), although some 20:3 is metabolized directly to 

35 series 1 eicosanoid metabolic regulators (Hwang, Fatty Acids in Foods and Their Health 
Implications, pp. 545-557. 1992). 



wo 00/34439 



PCT/US99/286S5 



-33- 

It is interesting to note that the alternate pathway of A®-desaturation begins with an 
elongation step. This elongation is the standard pathway in Euglena, which produces 
substantial amounts of 20:2 (7.4%) and 20:3 (1.4%) (FIG. 2). In mammalian tissue with little 
or no A® activity (Grammatikos et al., Br, J. Cancer 70:2^9-227, 1994), this would be the first 
5 step by which the essential fatty acids 18:2 and 18:3 are metabolized to their 20-carbon 
derivatives. Recently there has been a new emphasis on fatty acid chain elongation acting 
as a regulatory step in fatty acid biosynthesis (Garcia et al., Lipids 25:211-215, 1990; 
Sprecher et aL. Prostag. Leukot Essent Fatty Acids 52:99-101 , 1995). Evidence that breast 
cancer cells may selectively elongate 18:3 in preference to 18:2, and that A®-desatu ration 

10 follows this elongation (Bardon et al., Cancer Lett. 99:51-58, 1996) implies that A®- 
desaturation may play an important role in some cancer cells. 

The identification and cloning of a A®-desaturase gene pemiit examinations of the 
alternate pathway for biosynthesis of 20-carbon polyunsaturated fatty acids and will give 
insight into the possible mechanisms of A^-desaturation. In mammals, operation of this 

1 5 alternative pathway may be confined to specialized tissues, where the demand for 

polyunsaturated fatty exceeds the supply provided through rate-limiting A® desaturation. The 
pathway may be of greater significance where A® desaturation is reduced or absent. Since 

fatty-acid desaturase-metabolism-is-perturbed-in-many-cell-lines,-both-transformed 

(Grammatikos et al.. Ann, N. Y, Acad. Sci, 745:92-105, 1994) and untransformed 

20 (Rosenthal, Prog. Lipid Res. 26:87-124, 1987), it may be that A® activity is only revealed in 
the absence of A® activity. Alternatively, A^ activity may arise or increase with cell neoplasia. 
The isolation and examination of this A^ gene, and analysis of its substrate specificity, should 
facilitate the determination of the role of A° activity in both normal and cancerous 
mammalian tissue. 

25 

Example 14: Discussion of the C. eleaans -desaturase. 

In this example, we describe a region of the C. elegans genome located at position 

5 6 

4.88 of chromosome IV is described that contains the A - and A - desaturase genes. The 
amino acid sequences encoded by the two genes are 46% identical to each other, and each 
30 contains an N-terminal heme binding domain typical of the electron carrier cytochrome 

and three histidine boxes. Both genes contain the consensus sequence of the third His box 
(QXXHH; SEQ. ID NO. 11) that has so far been shown to be unique to the microsomal 
desaturases involved in double-bond insertion at carbons below position 9. 

Despite these similarities, these two microsomal desaturases show absolutely non- 
35 overlapping substrate specificities. When overexpressed in the yeast Saccharomyces 

6 

cerevisiae, the C. eiegans A -desaturase (FAT-3) specifically acts on two 18-carbon 
substrates, linoleic and y-linolenic acid, and always desaturates in a methyiene-interrupted 
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6 

pattern (one double bond every three carbons). The mammalian A -desaturase system has 
likewise been demonstrated to insert double bonds strictly in a methylene-interrupted pattern 
and to have no activity on 20-carbon substrates (Schmitz et al., Lipids 12:307-313, 1997). 

5 

The C- elegans A -desaturase (FAT-4), in contrast, acts on a number of 20-carbon 
5 substrates, as well as on an endogenous 18:1 fatty acid of yeast, and is capable of inserting 
double bonds in a non-methylene interrupted pattern. 

5,11 5.11.14 5,11 

Non-methylene-intenxipted fatty acids such as 20:2 A , 20:3 A , 18:2 A 

14 

have been detected in mammalian cells by feeding C-labeled substrates to rats raised on a 
fat-deficient diet (Ulman et al., Biochem. Biophys, Acta 248:186-197.1971). However, these 
1 0 fatty acids are considered to be "dead end" metabolites, as they have not been 

demonstrated to serve as precursors to signaling molecules such as prostaglandins, nor are 
they detectable in tissue lipids of rats who are not preconditioned on a fat-deficient diet. (We 
also did not detect these fatty acids in C. elegans lipid extracts.) 

5 

In yeast expressing the C. elegans A -desaturase gene, the amount of substrate 
1 5 converted was greatest for the metabollcally significant substrate 20:3 A^'^^'^"* (FIG. 13). The 
amount of 20:2A^^'^'* and 20:3A^^'^^'^^ that was desaturated less than half the amount of 



-conventional substrate-that-was-desaturated^This-was consistent-with-the^rates_of_ 



desaturation in microsomal extracts of mammalian liver, where the rate of conversion of 
labeled 20:2A^^'^'* to 20:3A^'""^^ is 41% of the rate of conversion of labeled 20:3A°'^^'^^ to 

20 20:4A^*^'^^'^'* (Bemet et al., Biochem. Biophys, Acta 398:354-313, 1975). 

The C. elegans fat-3 and fat'4 genes are present in a gene cluster in the same 5' to 
3* orientation. Yet, unlike other gene clusters of this sort in C. elegans, the downstream faf-3 
gene is not transpliced to SL2, and therefore is unlikely to be co-transcribed with the 
upstream fat-4 gene. The two genes could be located next to each other as a result of an 

25 ancient gene-duplication event. The DNA sequences share 54% identity over the entire 
cDNA coding sequence; however the genes do not share any common intron/exon 
boundaries (FIG. 8). 

5 

This is the first disclosed sequence of a A -desaturase gene from an animal. The 
sequence of the C. elegans 

5 5 

30 A -desaturase is quite distant from the bacterial and fungal A -desaturases that have been 
reported, and this animal sequence should facilitate the search for desaturase-encoding 
sequences from humans and other mammals. Both the A^- and 

A^-desaturases are important regulatory enzymes in humans. They participate in critical 
steps in the pathway to produce precursors for synthesis of hormone-like eicosanoid 
35 molecules from the essential dietary fatty acids, linoleic acid and a-linolenic acid. The 
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activities of these desaturases have been shown to be under homnonal and nutritional 
control, but the mechanism of this control is stiil unknown. 

Certain diseases, such as diabetes, result in low A^-desaturase activity, while HTC 
cells, isolated from an ascites tumor derived from a solid hepatoma, show increased A^- 
5 desaturase activity. The availability of mutational and reverse genetic tools and the 

expanding knowledge of cellular and developmental biology in C. elegans make this an 
attractive system to study the roles of polyunsaturated fatty acids and their metabolic 
products in development, reproduction, and other cellular processes of animals. 

10 Example 15: A Plant Cell Transformed with the A^ and A^ -desaturase 
Genes of the Invention 

Using the methods described herein, A^ - and A° -desaturases of the invention may 

be cloned and expressed in plants to produce plants with enhanced amounts of 20-carbon 

polyunsaturated fatty acids. Such plants provide an inexpensive and convenient source of 

1 5 these important fatty acids in a readily harvestable and edible form. 

For instance, the A^ - and A® -desaturases of the invention can be cloned into a 
common food crop, such as corn, wheat, potato, tomato, yams, apples, pears, or into oil- 
seed plants such as sunflower, rapeseed, soy. or peanut plants. The resulting plant would 
express the appropriate enzyme that would'catal^^the'formation of 20^rbon 

20 polyunsaturated fatty acids. In the case of an oil-seed plant, the seed oil would be a rich 
source of 20-carbon polyunsaturated fatty acids. 

The A^ - and A° -desaturase genes may be cloned and expressed either individually, 
or together in a host plant cell. The corresponding desaturases can be expressed using a 
variety of different control sequences, such as promoters, enhancers, and 3'-termination 

25 sequences. These control sequences can be used to control the expression of each 

desaturase individually. For example, the A^ -desaturase can be cloned such that it is under 
the control of a strong promoter, and the 

A® -desaturase can be cloned such that it is under the control of a weak promoter, thus 
yielding a transgenic plant that expresses more A^ -desaturase than A° -desaturase. 

30 Furthermore, expression can be controlled by operably linking one or more of the desaturase 
genes of interest to a promoter that is activated by exposure of the plant cell to an 
appropriate regulatory agent such as an inducer, repressor, de-repressor or inhibitor agent. 
Such regulation is discussed above. Alternatively, expression of non-contiguous genes may 
be coordinated by linking the expression of a first gene with the expression of an inducer or 

35 de-repressor molecule that induces or de-represses the expression of a second gene. 
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The genes of the invention can be integrated into the genome of a plant (for 
example, by Agrobacterium-mediaied 

T-DNA transfer) or animal (for example, by use of broad host-range retroviruses, e.g., an 
adenovirus vector) so that the and -desaturases of the invention are expressed as part 
5 of the genome. For transgenic plants, the T-DNA vector may be used that would result in 
integration of transgenes (A^ and/or A®) into the host ceil genome. 

For example, the expression of the A^-and A® - desaturases in a plant, such 
as Arabidopsis, can be achieved by constructing a plant transformation vector to introduce 
the cDNA of each desaturase into the plant. The vector can contain a tissue specific 

10 promoter so that the desaturase protein will be expressed during seed development. 

Examples of seed specific promoters include that for phaseolin (van der Geest and Hall, 
Plant MoL Biol, 32:579-88. 1996) or the promoter for napin (Stalberg et al.. Plant Mol. Biol. 
23: 671-83, 1993). Other seed-specific promoters that can be used are those located on the 
genomic BAG clone T24A18 (LOCUS ATT24A18. (1999) 45980 bp Arabidopsis thaliana 

1 5 DNA chromosome 4,ACCESSION # AL035680, NID g4490701 ) of the Arabidopsis genome. 
These promoters regulate seed storage protein expression in Arabidopsis, Other promoters 
which express genes speciftcally in seeds like those described in (Parcy et a!.. Plant Cell 
6:1557-1582. 1994) can also be used. The constructs containing the desaturase coding 

sequence and promoter sequence can then transferred to standard plant transformation T- 

20 DNA vectors, similar to pART27 (Gleave, Plant MoL Biol 20:1203-1207, 1992). pGPTV 
(Becker et al., Plant Mol. BioL 20:1195-7. 1992), or pJIT119 (Guerineau et al., Plant MoL 
BioL 15:127-136, 1992). If the plant is to be transformed with two constructs, i.e. one 
encoding the A®-desaturase and the other encoding the A^-desaturase, then it is preferable 
to choose two different selectable markers so that only double transformants will regenerate. 

25 For example, the vector carrying the A^-desaturase can be constructed such that it contains 
the kanmycin {npt\\) gene, and the vector carrying the A®-desaturase can be constructed 
such that it contains the phosphinothricin {bar) gene. Transformants are then selected on 
media containing kanmycin and phosphinothricin. Transformation of Arabidopsis is readily 
achieved using the Agrobacterium-mediated vacuum infiltration process (Katavic et al., MoL 

30 Gen. Genet 245:363-70, 1994) or the floral dip modification of it (Clough and Bent. Plant J. 
16:735-43, 1998). although several other methods are also commonly used. Transgenic 
progeny will be identified by selection using the appropriate antibiotic or herbicide, either 
kanmycin or phosphinothricin. or both. Since the A® and A^ constructs use different 
selectable markers the double transformants are readily isolated. Plants which survive the 

35 transgenic selection are grown to maturity and their seed harvested. The seeds of 

transformed plants are analyzed by isolation of fatty acid methyl esters followed by gas 
chromatography to determine the fatty acid composition. 
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Plants expressing only the A®-desaturase will desaturate the 20:1 fatty acid that 
occurs naturally in the Arabidopsis seed to 20:2A®*^\ Seed harvested from plants doubly 
transformed with both desaturases will, in addition, convert the 20:2A®'^^ product of the A®- 
desaturase plants to 20:3A^'®'^^ as a result of the expression of the A^-desaturase. These 
5 changes will be easily detected by the fatty acid methyl ester analysis. 

Example 16: A Yeast Cell Transformed with the A^ - and A^ -desaturase Genes of the 
Invention 

The cDNA portion of pJW541 (Wallls and Browse, Arch, Biochem, Biophys. 

1 0 365:307-316, 1999) containing the Euglena A® -desaturase was excised from that plasmid 
with the restriction enzymes EcoRI and Spel. The purified DNA fragment representing the 
insert was ligated into the yeast expression vector pYX232 (R&D Systems. Inc.) that had 
been prepared by digestion with EcoRl and Nhe\, to give compatible sticky ends. (Plasmid 
pYX232 carries the marker conferring yeast prototrophy for tryptophan {TRP1 mutation), and 

1 5 uses the those phosphate isomerase (TPI) promoter for constitutive expression of the 

inserted DNA.) The resulting plasmid, pYX232-541, was introduced into the Saccharomyces 
cerevisiae strain already harboring the A* -desaturase (pYFAT4; Watts and Browse, Arch. 
Biochem. Biophys, 362:175-182, 1999) plasmid that confers yeast prototrophy for uracil 
using a lithiu m acetate transformation procedure (Invitrogen). Transfomnants were selected 

20 simultaneously for uracil and tryptophan prototrophy. Selected colonies arising after the 

transformation were inoculated into yeast minimal medium that also lacked both uracil and 
tryptophan. 

For analysis of activity, separate cultures were supplemented with one of three fatty 
acid substrates provided as sodium salts as described (Wailis and Browse. Arch. Biochem. 

25 Biophys. 365:307-316, 1 999). After overnight culture at 28''C. the cultures were harvested 
by centrifugation and washed. Fatty acid methyl esters were prepared using the standard 
methods described in Miquel and Browse J. BioL Chem. 267:1502-1509, 1992. 

Analysis by gas chromatography indicated that each substrate had been 
desaturated twice. The incorporation of the three substrates varied, with more unsaturated 

30 substrates becoming a greater part of the fatty acid composition of the cells, as seen in other 
experiments (Wailis and Browse, Arch. Biochem. Biophys. 365:307-316, 1999, and Watts 
and Browse, Arch. Biochem. Biophys. 362:175-182, 1999. For the tri-unsatu rated substrate 
20:3A^^*^'* ''^ the 20-carbon fatty acid represented 37% of the total cellular fatty acid, for 
20:2A^^''"* the 20-carbon fatty acid level 21%. and for 20:1 A^^ the 20-carbon fatty acid level 

35 reached only 13%. However, the activities of the desaturases were substantially identical 
against ail three substrates. Between 70 and 72% of the substrate was not converted, and 
17 or 18% underwent a single desaturation by only one of the enzymes. However, for each 
substrate, between 1 1 and 13% of the substrate was desaturated by both enzymes acting in 
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concert to produce a fatty acid with two double bonds more than in molecules of the supplied 
substrate. 



TABLE 2 



Fatty acid 
supplement 


Fatty acid 
uptal<e* 


Unconverted 
substrate # 


One added 
desaturation# 


Doubly desaturated 
product # 


20:3 
(11.14.17) 


37 


71 


17 


11 


20:5 (5,8.11,14,17) 


20:2 
(11.14) 


21 


70 


18 


13 


20:4 (5.8.11.14) 


20:1 
(11) 


13 


72 


18 


11 


20:3(5.8.11) 



*as mass percent of whole cell fatty acids 

#as mass percent of incorporated 20-carbon fatty acids 

The foregoing embodiments and examples are provided only as examples and are 

in no way meant to limit the scope of the claimed i nvention. 

TO It should be apparent to one skilled in the art that the Invention described herein can 

be modified in arrangement and detail without departing from the scope or spirit of the 
invention. We claim alt such modifications. 

The references and publications referred to herein are hereby incorporated by 
reference. 
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1. A purified protein having desaturase activity, and connprising an amino acid 
sequence selected from the group consisting of: 

(a) an amino acid sequence as shown in SEQ. ID NO. 4; 
5 (b) an amino acid sequence that differs from that specified in (a) by one 

or more conservative amino acid substitutions; and 

(c) an amino acid sequences having at least 60% sequence identity to 
the sequences specified in (a) or (b). 

10 2. An isolated nucleic acid molecule encoding a protein according to claim 1. 

3. The isolated nucleic acid molecule of claim 2. comprising a sequence as 
shown in SEQ ID NO: 2. 

15 4. A recombinant nucleic acid molecule, comprising a control sequence 

operably linked to the nucleic acid sequence of claim 2. 

5. A cell transformed with the recombinant nucleic acid molecule of claim 4. 



20 6. A cell transformed with the recombinant nucleic acid molecule of claim 4 and 

a nucleic acid molecule selected from the group consisting of: 

(a) a nucleic acid molecule as shown in SEQ ID NO: 1; and 

(b) a nucleic acid molecule that has 60% sequence identity to the nucleic acid 
molecule shown in (a). 

25 

7. The cell of claim 5. wherein the cell is a plant cell. 

8. An isolated nucleic acid molecule that: 

(a) hybridizes under low-stringency conditions with a nucleic acid probe, the 
30 probe comprising a sequence as shown in SEQ ID NO: 3, and fragments thereof; 

and 

(b) encodes a protein having desaturase activity. 

9. A desaturase encoded by the nucleic acid molecule of claim 8. 



35 
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10. A recombinant nucleic acid molecule, comprising a promoter sequence 
operably linked to the nucleic acid molecule of claim 8. 

11. A cell transformed with the recombinant nucleic acid molecule of claim 10. 

12. A transgenic organism, comprising the transformed cell of claim 11, wherein 
the transgenic organism is selected from the group consisting of plants, bacteria, insects, 
fungi, and mammals. 

13. A specific binding agent that binds to the desaturase of claim 9. 

14. An isolated nucleic acid molecule that: 

(a) has at least 60% sequence identity with a nucleic acid sequence as shown in 
SEQ ID NO: 3; and 

(b ) encodes a protein ha vinq desaturase-activitv 

15. A method of identifying a nucleic acid sequence, comprising: 

(a) hybridizing the nucleic acid sequence to at least 10 contiguous nucleotides of a 
sequence as shown in SEQ ID NO: 3; and 

(b) identifying the nucleic acid sequence as one that encodes a desaturase. 

16. A nucleic acid molecule identified by the method of claim 15. 

17. The method of claim 15, wherein hybridizing the nucleic acid sequence is 
performed under low-stringency conditions. 

18. A desaturase encoded by the nucleic acid molecule of claim 15. 

19. A specific binding agent, that binds the desaturase of claim 18. 
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20. The method of claim 15, wherein step (a) occurs in a PGR reaction. 

21 The method of claim 15, wherein step (a) occurs during a library screening. 

5 22. A method for creating a double bond between two carbons in a fatty acid, 

comprising: 

contacting a fatty acid with at least one purified desaturase of claim 17; and 
allowing the desaturase to create a double-bond between two carbons. 

1 0 23- The method of claim 22. wherein the desaturase is expressed in a 

transgenic organism and the double-bond formation occurs in vivo. 

24. The method of claim 23, wherein the desaturase is expressed in an 
organism selected from the group consisting of eukaryotes and prokaryotes. 



-15- 



25. The method of claim 22, wherein the desaturase is expressed in vitro and 
the double-bond formation occurs in vitro. 



26. The method of claim 22, further comprising expressing a second 
20 desaturase. 



27. The method of claim 26, wherein the second desaturase is selected from 
the group consisting of: 

(a) an amino acid sequence as shown in SEQ. ID NO. 2; 
25 (b) an amino acid sequence that differs from those specified in (a) by 

one or more conservative amino acid substitutions; and 

(c) an amino acid sequences having at least 60% sequence identity to 
the sequences specified in (a) or (b). 
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A-6 pathway 



co-3 fatty acids 



18:3 (9,12,15) --> 18:4 (6,9,12,15) 



+2C ^ Eicosapentaenoate 
20:4 (8, 1 1 , 1 4, 1 7) — > 20: 5 (5,8, 1 1 , 1 4, 1 7) 

+2C ^ 
22:5 (7,10,13,16,19) 



A^ 



(0-6 fatty acids 



18;2_(9, 12)_r==>_1.8:3_(6,9, 1 2)- 



+2C ^ A^ Arachadonate 
20:3 (8,11,14) — > 20:4 (5,8, 11,14) 

+2C ^ 
22:4 (7,10,13,16) 



FIG. lA 



SUBSTITUTE SHEET (RULE 26) 
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A-8 
pathway 



co-3 fatty acids 

18:3 (9,12,15) 

+2C ^ A* Eicosapentaenoate 
20:3 — >20:4 (8,1 1,14,17) —> 20:5 
(11,14,17) (5,8,11,14,17) 

+2C ^ 
22:5 

(77l'07r37r6719)-" 



(D-fatty acids 

18:2 (9,12) 

+2C ^ A^ A^ Arachadonate 

20:2(11,14) — > 20:3(8,11,14) — > 20:4(5,8,11,14) 

+2C ^ 

22:4 (7,10,13,16) 



FIG. IB 
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Retention Time 



PK 


RT 


FA 


% 


PK 


RT 


FA 


1 


10-0 


20:2A11,14 


7.2 


4 


11.7 


20:5 A5,8,l 1,14,17 


6.2 


2 


10.3 


20.3 A8,l 1,14 


6.3 


5 


14.0 


22:4 A7,10,13,16 


2.9 


3 


10.7 


20:4 A5,8,ll,14 


9.0 
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1 MVLREQEHEP FFIKIDGKWC QIDDAVUISH PGGSAITTYK NMDATTVFHT 

51 FHTGSKEAYQ WLTELKKECP TQEPEIPDIK DDPIKGIDDV NMGTFNISEK 

101 RSAQINKSFT DLRMRVRAEG LMDGSPLFYI RKILETIFTI LFAFYIiQYHT 

151 YYliPSAILMG VAWQQLGWLI HEFAHHQLFK NRYYNDLASY FVGNFLQGFS 

2 01 SGGWKEQHNV HHAATNWGR DGDLDLVPFY ATVAEHLNNY SQDSWVMTLF 

251 RWQHVHWTFM LPFLRLSWLL QSIIFVSQMP THYYDYYMIT AIYEQVGLSL 

301 HWAWSLGQIiY FLPDWSTKIM FFLVSHLVGG FLLSHWTFN HYSVEKFALS 

351 SNIMSNYACIi QIMTTRNMRP GRFIDWLWGG liNYQIEHHLF PTMPRHNLNT 

401 VMPLVKEFAA ANGLPYMVDD YFTGFWIiEIE QFRNIANVAA KliTKKIA 



FIG. 6A 
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1 GAATTTTCAA TCCTCCTTGG GTCCCACCGC TGTGATATCA AAATGGTATT 
51 ACGAGAGCAA GAGCATGAGC CATTCTTCAT TAAAATTGAT GGAAAATGGT 
101 GTCAAATTGA CGATGCTGTC CTGAGATCAC ATCCAGGTGG TAGTGCAATT 
151 ACTACCTATA AAAATATGGA TGCCACTACC GTATTCCACA CATTCCATAC 
201 TGGTTCTAAA GAAGCGTATC AATGGCTGAC AGAATTGAAA AAAGAGTGCX: 
251 CTACACAAGA ACCAGAGATC CCAGATATTA AGGATGACCC AATCAAAGGA 
301 ATTGATGATG TGAACATGGG AACTTTCAAT ATTTCTGAGA AACGATCTGC 
351 CCAAATAAAT AAAAGT3TJCA. CTGATCTACG TATGCGAGTT CGTGCAGAAG 
401 GACTTATGGA TGGATCTCCT TTGTTCTACA TTAGAAAAAT TCTTGAAACA 
451 AtCTTCACAA TTCTTTTTGC ATTCTACCTT CAATACCACA CATATTATCT 
501 TCCATCAGCT ATTCTAATGG GAGTTGCGTG GCAACAATTG GGATGGTTAA 
551 TCCATGAATT CGCACATCAT CAGTTGTTCA AAAACAGATA CTACAATGAT 
601 TTGGCCAGCT ATTTCGTTGG AAACTTTTTA CAAGGATTCT CATCTGGTGG 
€51 TTGGAAAGAG CAGCACAATG TGCATCACGC AGCCACAAAT GTTGTTGGAC 
701 GAgACGGAGA TCTTGATTTA GTCCCATTCT ATGC TACAGT GGCAGAACAT- 
751 CTCAACAATT ATTCTCAGGA TTCATGGGTT ATQACTCTAT TCAGATGGCA 
801 ACATGTTCAT TGGACATTCA TGTTACCATT CCTCCGTCTC TCGTGGCTTC 
851 TTCAGTCAAT CATTTTTGTT AGTCAGATGC CAACTCATTA TTATGACTAT 
901 TACAGAAATA CTGCGATTTA TGAACAGGTT GGTCTCTCTT TGCACTGGGC 
951 TTGGTCATTG GGTCaATTGT ATTTCCTACC CGATTGGTCA ACTAAAATAA 
1001 TGTTCTTCCT TGTTTCTCAT CTTGTTGGAG GTTTCCTGCT CTCTCATGTA 
1051 GTTACTTTCA ATCATTATTC AGTGGAGAAG TTTGCATTGA GCTCGAACAT 
1101 CATGTCAAAT TACGCTTGTC TTCAAATCAT GACCACAAGA AATATGAGAC 
1151 CTGGAAGATT CATTGACTGG CTTTGGGGAG GTCTTAACTA TCAGATTGAG 
1201 CACCATCTTT TCCCAACGAT GCCACGACAC AACTTGAACA CTGTTATGCC 
1251 ACTTGTTAAG GAGTTTGCAG CAGCAAATGG TTTACCATAC ATGGTCGACG 
1301 ATTATTTCAC AGGATTCTGG CTTGAAATTG AGCAATTCCG AAATATTCCA 
1351 AATGTTGCTG CTAAATTGAC TAAAAAGATT GCCTAGATTA CGATTAATTA 
1401 ATCAATTTAT TTTCATGTTC TATTCGTGTG TTTTAATATT TTCCAAATTT 
1451 TXACCTATTC C 
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1 MKSKRQALSP LQLMEQTYDV SAWVNFHPGG AEIIENYQGR DATDAEMVMH 

51 FQEAFDKIiKR MPKINPSFEL PPQAAVNEAQ EDFRKLREEIi lATGMFDASP 

101 LWYSYKISTT LGLGVLGYFL MVQYQMYFIG AVI*LGMHYQQ MGWLSHDICH 

151 HQTFKNRNWN NLVGLVFGNG LQGFSVTCWK DRHNAHHSAT NVQGHDPDID 

201 NLPPIiAWSED DVTRASPISR KLIQFQQYYF LVICILLRFI WCFQCVLTVR 

251 SLKDRDNQFY RSQYKKEAIG LALHWTLKAIi FHLFFMPSIL TSLLVFFVSE 

301 LVGGFGIAIV VFMNHYPLEK IGDPVWDGHG FSVGQIHETM NIRRGIITDW 

351 FFGGLNYQIE HHLWPTLPRH NLTAVSYQVE QLCQKHNIiPY RNPLPiffiGIiV 

401 ILLRYIiAVFA RMAEKQPAGK AL 

FIG. 7A 
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1 ATTTTTTTTC GAAATGAAGT CAAAGCGCCA AGCGCTATCC CCCTTACAAT 
SI TGATGGAACA AACATATGAT GTGGTCAATT TCCACCCTGG TGGTGCGGAA 
101 ATTATAGAGA ATTACCAAGG AAGGGATGCC ACTGATGCCT TCATGGTTAT 
151 GCACTTTCAA GAAGCCTTCG ACAAGCTCAA GCGCATGCCC AAAATCAATC 
201 CCAGTTTTGA GTTGCCACCC CAGGCTGCAG TGAATGAAGC TCAAGAGGAT 
251 TTCCG6AAGC TCCGAGAAGA GTTGATCGCA ACTGGCATGT TTGATGCCTC 
301 CCCCCTCTGG TACTCATACA AAATCAGCAC CACACTGGGC CTTGGAGTGC 
351 TGGGTTATTT CCTGATGGTT CAGTATCAGA TGTATTTCAT TGGGGCftGTG 
401 TTGCTTGGGA TGCACTATCA ACAGATGGGC TGGCTTTCTC ATGACATTTG 
451 CCACCACCAG ACTTTCAAGA ACCGGAACTG GAACAACCTC GTGGGACTGG 
501 TATTTGGCAA TGGTCTGCAA GGTTTTTCCG TGACATGTTG GAAGGACAGA 
551 CACAATGCAC ATCATTCGGC AACCAATGTT CAAGGGCACG ACCCTGATAT 
601 TGACAACCTC CCCCCCTTAG CCTGGTCTGA GGATGACGTC ACACGGGCGT 
651 CACCGATTTC CCGCAAGCTC ATTCAGTTCC AGCAGTACTA TTTCTTGGTC 
701 ATCTGTATCT TGTTGCGGTT CATTTGGTGT TTCCAGTGCG TGTTGACCGT 
751 GCGCAGTTTG AAGGACAGAG ATAACCAATT CTATCGCTCT CAGTATAAGA 
801 AGGAGGCCAT TGGCCTCGCC CTGCACTGGA CCTTGAAGGC CCTGTTCCAC 
851 TTATTC-riTA ..G ecCAGCAT-CCTCACATCG-eTGTTGGTGTl^TTTCGTTTC 



901 GGAGCTGGTT GGCGGCTTCG GCATTGCGAT CGTGGTGTTC ATGAACCACT 

951 ACCCACTGGA GAAGATCGGG GACCCA6TCT GGGATGGCCA TGGATTCTCG 

1001 GTTGGCCAGA TCCATGAGAC CATGRACATT CGGCGAGGGA TTATCACAGA 

X051 TTGGTTTTTC GGAGGCTTGA ATTACCAGAT TGAGCACCAT TTGTGGCCGA 

1101 CCCTCCCTCG CCACAACCTG ACAGCGGTTA GCTACCAGGT GGAACAGCTG 

tt 1151 TGCCAGAAGC ACAACCTGCC GTATCGGAAC CCGCTGCCCC ATGAAGG6TT 

^ 1201 GGTCATCCTG CTGCGCTATC TGGCGGTGTT CGCCCGGATG GCGGAGAAGC 

12S1 AACCCGCGGG GAAGGCTCTA TAAGG 



FIG. 7B 
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SEQUENCE LISTING 

<110> Browse, John et al . 

<120> Desaturases and Methods of Using Them for Synthesis of 
Polyunsaturated Fatty Acids 

<130> 53860 

<140> 
<141> 

<150> 60/111,301 
<151> 1998-12-07 

<160> 13 

<170> Patentin Ver. 2.0 

<210> 1 
<211> 1461 
<212> DNA 

<213> Caenorhabditis elegans 
<400> 1 

gaattttcaa tcctccttgg gtcccaccgc tgtgatatca aaatggtatt acgagagcaa 60 

gagcatgagc cattcttcat taaaattgat ggaaaatggt gtcaaattga cgatgctgtc 120 

ctgagatcac atccaggtgg tagtgcaatt actacctata aaaatatgga tgccactacc 180 

gtattccaca cattccatac tggttctaaa gaagcgtatc aatggctgac agaattgaaa 240 

aaagagtgcc ctacacaaga accagagatc ccagatatta aggatgaccc aatcaaagga 300 

attgatgatg tgaacatggg aactttcaat atttctgaga aacgatctgc ccaaataaat 360 

aaaagtttca ctgatctacg tatgcgagtt cgtgcagaag gacttatgga tggatctcct 420 
--ttgttctaca— ttagaaaaa-l^t-Gt^gaaaea—atet-teaeaa— ttrct^^ 

caataccaca catattatct tccatcagct attctaatgg gagttgcgtg gcaacaattg 540 

ggatggttaa tccatgaatt cgcacatcat cagttgttca aaaacagata ctacaatgat 600 

ttggccagct atttcgttgg aaacttttta caaggattct catctggtgg ttggaaagag 660 

cagcacaatg tgcatcacgc agccacaaat gttgttggac gagacggaga tcttgattta 720 

gtcccattct atgctacagt ggcagaacat ctcaacaatt attctcagga ttcatgggtt 780 

atgactctat tcagatggca acatgttcat tggacattca tgttaccatt cctccgtctc 840 

tcgtggcttc ttcagtcaat catttttgtt agtcagatgc caactcatta ttatgactat 900 

tacagaaata ctgcgattta tgaacaggtt ggtctctctt tgcactgggc ttggtcattg 960 

ggtcaattgt atttcctacc cgattggtca actagaataa tgttcttcct tgtttctcat 1020 

cttgttggag gtttcctgct ctctcatgta gttactttca atcattattc agtggagaag 1080 

tttgcattga gctcgaacat catgtcaaat tacgcttgtc ttcaaatcat gaccacaaga 1140 

aatatgagac ctggaagatt cattgactgg ctttggggag gtcttaacta tcagattgag 1200 

caccatcttt tcccaacgat gccacgacac aacttgaaca ctgttatgcc acttgttaag 1260 

gagtttgcag cagcaaatgg tttaccatac atggtcgacg attatttcac aggattctgg 1320 

cttgaaattg agcaattccg aaatattgca aatgttgctg ctaaattgac taaaaagatt 1380 

gcctagatta cgattaatta atcaatttat tttcatgttc tattcgtgtg ttttaatatt 1440 

ttccaaattt ttacctattc c 14 61 

<210> 2 
<211> 447 
<212> PRT 

<213> Caenorhabditis elegans 
<400> 2 

Met Val Leu Arg Glu Gin Glu His Glu Pro Phe Phe lie Lys lie Asp 
15 10 15 

Gly Lys Trp Cys Gin lie Asp Asp Ala Val Leu Arg Ser His Pro Gly 
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20 25 30 

Gly Ser Ala lie Thr Thr Tyr Lys Asn Met Asp Ala Thr Thr Val Phe 
35 40 45 

His Thr Phe His Thr Gly Ser Lys Glu Ala Tyr Gin Trp Leu Thr Glu 
50 55 60 

Leu Lys Lys Glu Cys Pro Thr Gin Glu Pro Glu lie Pro Asp lie Lys 
65 70 75 80 

Asp Asp Pro lie Lys Gly lie Asp Asp Val Asn Met Gly Thr Phe Asn 
85 90 95 

lie Ser Glu Lys Arg Ser Ala Gin lie Asn Lys Ser Phe Thr Asp Leu 
100 105 110 

Arg Met Arg Val Arg Ala Glu Gly Leu Met Asp Gly Ser Pro Leu Phe 
115 120 125 

Tyr lie Arg Lys lie Leu Glu Thr lie Phe Thr lie Leu Phe Ala Phe 
130 135 140 

Tyr Leu Gin Tyr His Thr Tyr Tyr Leu Pro Ser Ala lie Leu Met Gly 
145 150 155 160 

Val Ala Trp Gin Gin Leu Gly Trp Leu lie His Glu Phe Ala His His 
165 170 175 

Gin Leu Phe Lys Asn Arg Tyr Tyr Asn Asp Leu Ala Ser Tyr Phe Val 
180 185 190 



_Gly_Asn_Ehe-Leu-Gln-Gl-y— Phe— Ser^Sej^Gl-y—Gl-y— Torp-^ays^ 
195 200 205 

Asn Val His His Ala Ala Thr Asn Val Val Gly Arg Asp Gly Asp Leu 
210 215 220 

Asp Leu Val Pro Phe Tyr Ala Thr Val Ala Glu His Leu Asn Asn Tyr 
225 230 235 240 

Ser Gin Asp Ser Trp Val Met Thr Leu Phe Arg Trp Gin His Val His 
245 250 255 

Trp Thr Phe Met Leu Pro Phe Leu Arg Leu Ser Trp Leu Leu Gin Ser 
260 265 270 

lie lie Phe Val Ser Gin Met Pro Thr His Tyr Tyr Asp Tyr Tyr Arg 
275 280 285 

Asn Thr Ala lie Tyr Glu Gin Val Gly Leu Ser Leu His Trp Ala Trp 
290 295 300 

Ser Leu Gly Gin Leu Tyr Phe Leu Pro Asp Trp Ser Thr Arg lie Met 
305 310 315 320 

Phe Phe Leu Val Ser His Leu Val Gly Gly Phe Leu Leu Ser His Val 
325 330 335 

Val Thr Phe Asn His Tyr Ser Val Glu Lys Phe Ala Leu Ser Ser Asn 
340 345 350 
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He Met Ser Asn 
355 

Arg Pro Gly Arg 
370 

He Glu His His 
385 

Val Met Pro Leu 



Met Val Asp Asp 
420 



Arg Asn He Ala 
435 



Tyr Ala Cys Leu 
360 



Phe He Asp Trp 
375 

Leu Phe Pro Thr 
390 

Val Lys Glu Phe 
405 

Tyr Phe Thr Gly 



Asn Val Ala Ala 
440 



Gin He Met Thr 



Leu Trp Gly Gly 
380 



Met Pro Arg His 
395 

Ala Ala Ala Asn 
410 

Phe Trp Leu Glu 
425 

Lys Leu Thr Lys 



Thr Arg Asn Met 
365 

Leu Asn Tyr Gin 



Asn Leu Asn Thr 
400 



Gly Leu Pro Tyr 
415 

He Glu Gin Phe 
430 

Lys He Ala 
445 



<210> 3 
<211> 1275 
<212> DNA 

<213> Euglena gracilis 



<400> 3 ^^^^-t-ai-r-r- nr-nt-tacaat tqatggaaca 60 

attttttttc gaaatgaagt caaagcgcca ^9-9=^^^^^ atta^agaga a?tallaagg 120 
aacatatgat gtggtcaatt tccaccctgg ^ggtgcggaa ^^tatag g ^^^^ 
aagggatgcc actgatgcct tcatggttat 5=-=^^^^^^ SggSS tgaaJgaagc 24 0 
gcgcatgccc aaaatcaatc ccagttttga g"gccaccc J ^ ttgatgcctc 300 

Llagaggat ttccggaagc tccgagaaga ^ttgatcgca ^ctggcatgt ^^^^^^ 
-eeeeet-c^gg-tac-tcataca_aaa,t^g cac g^^^^^^^g^ ttgcttggga tgcL t-a«:a-4^Q— 
cctgatggtt cagtatcaga tjtatttcat tggggcagtg ttgcttggg^ .^ggaactg 480 
acagatgggc tggctttctc ^^gacatttg J tgacatgttg 540 

gaacaacctc gtgggactgg tatttggcaa ^^^^ctgca accctgatat 600 

gaaggacaga cacaatgcac atcattcggc ^accaatgtt ^^^^^^"^^ caccgatttc 660 
?gacaacctc ccccccttag cctggtctga g^^^gacgtc J^^^^^^^^t ^ ^ ,20 
ccgcaagctc attcagttcc agcagtacta "tcttggtc ^tctgtatct 9 ^^^^^^ 
catttggtgt ttccagtgcg tgttgaccgt gcg--^^^^ ^J^JJ^^^^^ ccttgaaggc 840 
ctatcgctct cagtataaga ^ggaggccat tggcctcgcc ttttcgtttc 900 

cctgttccac ttattcttta tgcccagcat ^^tcacatcg ^^^^^^g 9 ^^^^^^tgga 960 
ggagctggtt ggcggcttcg g^attgcgat -^tggtgttc ^tgaaccact ^^20 
gaagatcggg gacccagtct gggatggcca tggattctcg ^"^^ ^ attaccagat 1080 

ccS-s c fe ,=»=-f ..JO 

T^^ii r,rrt?" f/=rc^sr, ^r.r.aS= 

gaaggctcta taagg 



<210> 4 
<211> 422 
<212> PRT 

<213> Euglena gracilis 



^"^^^^ ^ T no Ala Leu Ser Pro Leu Gin Leu Met Glu Gin 

Met Lys Ser Lys Arg Gin Ala Leu ber rru 

1 ^ 
Thr Tyr Asp Val Ser Ala Trp Val Asn Phe His Pro Gly Gly Ala Glu 



20 25 



3 



wo 00/34439 PCT/US99y28655 

lie lie Glu Asn Tyr Gin Gly Arg Asp Ala Thr Asp Ala Phe Met Val 
35 40 45 

Met His Phe Gin Glu Ala Phe Asp Lys Leu Lys Arg Met Pro Lys lie 
50 55 60 

Asn Pro Ser Phe Glu Leu Pro Pro Gin Ala Ala Val Asn Glu Ala Gin 
65 70 75 80 

Glu Asp Phe Arg Lys Leu Arg Glu Glu Leu lie Ala Thr Gly Met Phe 
85 90 95 

Asp Ala Ser Pro Leu Trp Tyr Ser Tyr Lys He Ser Thr Thr Leu Gly 
100 105 110 

Leu Gly Val Leu Gly Tyr Phe Leu Met Val Gin Tyr Gin Met Tyr Phe 
115 120 125 

He Gly Ala Val Leu Leu Gly Met His Tyr Gin Gin Met Gly Trp Leu 
130 135 140 

Ser His Asp He Cys His His Gin Thr Phe Lys Asn Arg Asn Trp Asn 
145 150 155 160 

Asn Leu Val Gly Leu Val Phe Gly Asn Gly Leu Gin Gly Phe Ser Val 
165 170 175 

Thr Cys Trp Lys Asp Arg His Asn Ala His His Ser Ala Thr Asn Val 
180 185 190 

Gin Gly His Asp Pro Asp He Asp Asn Leu Pro Pro Leu Ala Trp Ser 
195 200 205 



Glu Asp Asp Val Thr Arg Ala Ser Pro He Ser Arg Lys Leu He Gin 
210 215 220 

Phe Gin Gin Tyr Tyr Phe Leu Val He Cys He Leu Leu Arg Phe He 
225 230 235 240 

Trp Cys Phe Gin Cys Val Leu Thr Val Arg Ser Leu Lys Asp Arg Asp 
245 250 255 

Asn Gin Phe Tyr Arg Ser Gin Tyr Lys Lys Glu Ala He Gly Leu Ala 
260 265 270 

Leu His Trp Thr Leu Lys Ala Leu Phe His Leu Phe Phe Met Pro Ser 
275 280 285 

He Leu Thr Ser Leu Leu Val Phe Phe Val Ser Glu Leu Val Gly Gly 
290 295 300 

Phe Gly He Ala He Val Val Phe Met Asn His Tyr Pro Leu Glu Lys 
305 310 315 320 

He Gly Asp Pro Val Trp Asp Gly His Gly Phe Ser Val Gly Gin He 
325 330 335 

His Glu Thr Met Asn He Arg Arg Gly He He Thr Asp Trp Phe Phe 
340 345 350 

Gly Gly Leu Asn Tyr Gin He Glu His His Leu Trp Pro Thr Leu Pro 
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365 



Arg His Asn Leu Thr Ala Val Ser Tyr Gin Val Glu Gin Leu Cys Gin 
370 375 380 

Lys His Asn Leu Pro Tyr Arg Asn Pro Leu Pro His Glu Gly Leu Val 
385 390 395 400 

lie Leu Leu Arg Tyr Leu Ala Val Phe Ala Arg Met Ala Glu Lys Gin 
405 410 415 

Pro Ala Gly Lys Ala Leu 
420 



<210> 5 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PGR Primer 
<220> 

<221> variation 
<222> (1) . , (27) 
<223> y = t or c 

<220> 

<221> variation 
<222> (1) . . (27) 
<223> n= a, t^ c, or g 



<220> 

<221> variation 
<222> (1) . . (27) 
<223> r a or g 

<400> 5 

ggctggctga cncaygartt ytgycay 27 

<210> 6 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PGR Primer 
<220> 

<221> variation 
<222> (1) . . (30) 
<223> n = a, t, q, or c 

<220> 

<221> variation 
<222> (1) . . (30) 
<223> r = a or g 

<220> 

<221> variation 
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<222> (1) . . (30) 
<223> y = t or c 



<400> 6 

catcgttgga aanarrtgrt gytcdatytg 



30 



<210> 7 
<211> 41 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PGR Primer 
<400> 7 

cccgggaagc ttctcgagga attttcaatc ctccttgggt c 41 

<210> 8 
<211> 34 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PGR Primer 



<210> 9 
<211> 6 
<212> RNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: 
Polyadenylation Signal 

<400> 9 

aauaaa 6 

<210> 10 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PGR Primer 



<210> 11 
<211> 9 
<212> RNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Histidine box 
<400> 11 

uuuuuuucg 9 



<400> 8 

cccgggtgga tccggaacat atcacacgaa acag 



34 



<400> 10 

tctgggatct ctggttcttg 



20 
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<210> 12 
<211> 5 
<212> PRT 

<213> Axtificial Sequence 
<220> 

<223> Description of Artificial Sequence: Histidine Box 
<220> 

<221> VARIANT 

<222> (1) . . (5) 

<223> Xaa = any amino acid 

<400> 12 

His Xaa Xaa His His 
1 5 



<210> 13 
<211> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Histidine Box 
<220> 

<221> VARIANT 

<222> (1) . . (5) 

<223> Xaa = any amino acid 

-<-4 0 Q->— 1-3 ^ " 

Gin Xaa Xaa His His 
1 5 
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